While checking general health of our SCOM 2007 R2 database server (which also houses datawarehouse) I found in the operations manager log we have a constant repeating pattern of:
Event 1003
Summary: 849 rule(s)/monitor(s) failed and got unloaded, 0 of them reached the failure limit that prevents automatic reload........
then immediately:
Event 4000
A monitoring host is unresponsive or has crashed. The status code for the host failure was 2164195371.
I did some research and found recommendations to install a hotfix, but that was designed for scom 2007 before R2.
The errors repeat in a sporadic manner. Sometimes as little as 45 minutes between, sometimes up to 12 hours.
Please let me know how I can diagnose and repair. Thank you!