To diagnose unstable hybrid mail flow in your Exchange Hybrid deployment, you can follow these break-fix steps:
- Check Receive Connectors: Ensure that the Receive connectors on your on-premises Exchange servers are configured correctly. Use the following PowerShell command to verify that the connectors are listening on TCP port 25 for both IPv4 and IPv6:
Then, check the bindings with:(Get-HybridConfiguration).ReceivingTransportServers
The expected output for Bindings should beGet-ReceiveConnector -Server "<Server Name>" | Format-Table Identity,Bindings{[::]:25, 0.0.0.0:25}. - Monitor Transport Services: Since you mentioned that mail flow resumes after restarting the on-prem Exchange Transport service, monitor the service for any errors or performance issues. Check the event logs for any related errors that might indicate why the service fails intermittently.
- Validate Certificates: Ensure that the certificates used for secure mail transport are valid and correctly installed on all relevant Exchange servers. They should be issued by a commercial certification authority, not expired, and assigned to the necessary services (IIS and SMTP).
- Test Connectivity: Use the Microsoft Remote Connectivity Analyzer to test inbound Internet connectivity to your on-premises Exchange servers. This can help identify issues with Autodiscover and Exchange Web Services (EWS) endpoints:
- Run tests for Synchronization, Notification, Availability, and Automatic Replies.
- Run the Inbound SMTP Email test to verify SMTP connectivity.
- Review Message Tracking Logs: Since you've already checked the message tracking logs, continue to monitor them for patterns or specific errors that occur when the mail flow fails. This can provide insights into whether the issue is with specific senders, recipients, or times.
- Check Network Configuration: Ensure that your firewall and network configurations allow for proper communication between the on-premises Exchange servers and Exchange Online. Verify that there are no intermittent network issues that could affect mail flow.
- Re-run Hybrid Configuration Wizard: If you suspect configuration issues, consider re-running the Hybrid Configuration Wizard to ensure that all settings are correctly applied and up to date.
By systematically checking these components, you should be able to identify the root cause of the unstable hybrid mail flow and take corrective action.
References: