We have two Windows 2008 R2 servers with System Center Operations Manager 2012 R2 RU2 installed. From time to time, several dozen (but not all) servers are greyed out in Operations Managers and the server itself tells you it can't connect to the management server:
The OpsMgr Connector could not connect to RMSSERVER.domain.com:5723. The error code is 10060L(A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.). Please verify there is network connectivity, the server is running and has registered it's listening port, and there are no firewalls blocking traffic to the destination.
The event is correct because even a telnet to port 5723 does not give any response! When we restart the System Data Access Service everything is working fine again for a couple of days (e.g. 8 days).
We have noticed the issue on both servers at the same time, but also on one of the two servers.
There is no firewall between the OpsMgr servers and the clients and there is no security software running on the OpsMgr management servers. I have checked the SPN records but those are all there and looking good:
C:\Users\admin>setspn -l opsmgr1Registered ServicePrincipalNames for CN=opsmgr1,OU=System Center,OU=Servers,OU=IT Management,DC=company,DC=local:
MSOMSdkSvc/opsmgr1.company.local
MSOMSdkSvc/opsmgr1
MSOMHSvc/opsmgr1.company.local
MSOMHSvc/opsmgr1
TERMSRV/opsmgr1.company.local
WSMAN/opsmgr1.company.local
RestrictedKrbHost/opsmgr1.company.local
HOST/opsmgr1.company.local
TERMSRV/opsmgr1
WSMAN/opsmgr1
RestrictedKrbHost/opsmgr1
HOST/opsmgr1
I noticed the issue is there from the beginning so i can not relate it to a particular change.
Anyone have good suggestions other then creating a call with Microsoft :).