Quantcast
Channel: Operations Manager - General forum
Viewing all 11941 articles
Browse latest View live

Cluster Disks Monitoring

$
0
0

Hello,

The Cluster Disks Monitoring is not refreshing properly on time... it takes days to get the monitor reset after correction of the issue (space %,  Space MB).

Even when running a Reset Health on the Health Explorer there is no return ...it seems just not doing anything... later 1-2 days after I will see a new alert coming...

- Delay on 50% of the Cluster Disks

- No return at all on 15%

Any idea what is wrong?

Thanks,

Dom


System Center Operations Manager 2007 / System Center Configuration Manager 2007 R2 / Forefront Client Security / Forefront Identity Manager


SCOM 2012 R2 Network Monitoring error

$
0
0

Hi Gurus

I am having a weird issue with SCOM network monitoring. I previously discovered and monitored network devices using recursive network discovery. Today I suddenly discover that all my network devices under Administration-->Network Management-->Network devices are all gone!!! So I use the discovery rule that I previously had, re-ran it on one of the core switches, the previously discovered column displays the same number of network devices as there were previously but the network devices option is still empty. The core switch that I ran the discovery on is now under network device pending management and the error being no response SNMP. If I want to delete that core switch from pending management, I get the error "An object of class MonitoringConnector with ID xxxx was not found". I can also see an internal connector named network devices pending management. I can't see anything else related to network devices under internal connector. How can I resolve this issue and start monitoring the network devices again, I am completely stumped!!!

As an idea, if I delete the internal connector, try deleting the device pending management, delete the discovery rule and try rediscover the network device, will that work? I am a bit scared that if I delete the internal connector, I cannot recreate it. I also want to know that whether I can delete the network device pending management by running SQL queries on the SCOM Database? 

Please help, any assistance will be greatly appreciated.

Regards,

Dipan


Removing discovered server from SQL Computers Group

$
0
0

Hello,

I am trying to get 2 specific servers to not populate under the "SQL Computers" group. Specifically in the Microsoft SQL Server > Computers view in the SQL MP

I have created a group containing the 2 computers and set it as the target for the override on discovery "Discover SQL Server Computer Group membership"

I then run the Remove-SCOMDisabledClassInstance cmdlet but the computers remain in this view.

I attempted the same steps to disable the SQL DB Engine discovery for these 2 servers and it worked fine

Any ideas?

SCOM Side By Side Migration from 2007 to 2012

$
0
0

We current have SCOM 2007R2 (6.1.7221) installed and we plan to install SCOM 2012 R2 (latest release) in a side-by-side migration.  If we deploy the SCOM 2012R2 Client to existing systems being monitored, can we multi-home the clients to both SCOM 2007R2 and SCOM 2012R2? Are there any version requirements or issues that we could face during the migration since we on (6.1.7221)?

Again, this is a side by side migration path and not an upgrade to the SCOM 2007 servers. :) I see other forum topics almost just like this, but don't see any supporting documentation.

Event 2115 and 31551 generated

$
0
0

Hi All

I know there is much information relating to the two event IDs above, mainly relating to performance issues.

Essentially, I am seeing the above Event ID for the following Workflows:

  • Microsoft.SystemCenter.DataWarehouse.CollectEventData
  • Microsoft.SystemCenter.DataWarehouse.CollectPerformanceData
  • Microsoft.SystemCenter.DataWarehouse.Synchronization.ManagedEntity
  • Microsoft.SystemCenter.DataWarehouse.Synchronization.DomainSnapshot 

There are others, but these are the main ones.  However it is the content of the message that is interesting.  E.g.

Log Name:      Operations Manager
Source:        Health Service Modules
Date:          22/01/2014 13:14:50
Event ID:      31551
Task Category: Data Warehouse
Level:         Error
Keywords:      Classic
User:          N/A
Computer:      <SERVER>
Description:
Failed to store data in the Data Warehouse. The operation will be retried.
Exception 'SqlException': Sql execution failed. Error 6602, Level 16, State 2, Procedure sp_xml_preparedocument, Line 1, Message: The error description is 'End tag 'ResolutionStateId' does not match the start tag 'nStateId'.'.
The XML parse error 0xc00ce56d occurred on line number 1, near the XML text "<Root><Item><DomainName>ResolutionState</DomainName><DomainContentXml><Root><ResolutionState><ResolutionStateId>254</ResolutionStateId><ResolutionStateGuid>497186A1-5606-4B0C-91C1-30BA37232A82</ResolutionStateGuid><ResolutionStateName>Resolved</ResolutionStateName><PredefinedInd>1</PredefinedInd><CreatedDateTime>2013-10-09T16:43:58.353</CreatedDateTime><LastModifiedDateTime>2013-10-09T16:43:58.353</LastModifiedDateTime></ResolutionState><ResolutionState><ResolutionStateId>247</ResolutionStateId><ResolutionStateGuid>F791ED45-DCB4-475F-8A36-539A9CA8E495</ResolutionStateGuid><ResolutionStateName>Awaiting Evidence</ResolutionStateName><PredefinedInd>1</PredefinedInd><CreatedDateTime>2013-10-09T16:43:58.353</CreatedDateTime><LastModifiedDateTime>2013-10-09T16:43:58.353</LastModifiedDateTime></ResolutionState><ResolutionState><nStateId>254</ResolutionStateId><ResolutionStateGuid>497186A1-5606-4B0C-91C1-30BA37232A82</ResolutionStateGuid><ResolutionStateName>Resolved</ResolutionStateName><PredefinedInd>1</PredefinedI". 

One or more workflows were affected by this.  

Workflow name: Microsoft.SystemCenter.DataWarehouse.Synchronization.DomainSnapshot 
Instance name: eb249cfa-4f62-4bd9-a181-a6641d78ebd9 
Instance ID: {1600518F-0CD5-5385-5C79-AABC665ED81B} 
Management group: <MG>

Log Name:      Operations Manager
Source:        Health Service Modules
Date:          22/01/2014 13:14:43
Event ID:      31551
Task Category: Data Warehouse
Level:         Error
Keywords:      Classic
User:          N/A
Computer:      <SERVER>
Description:
Failed to store data in the Data Warehouse. The operation will be retried.
Exception 'SqlException': XML parsing: line 1, character 108, illegal xml character 

One or more workflows were affected by this.  

Workflow name: Microsoft.SystemCenter.DataWarehouse.CollectEventData 
Instance name: <SERVER>
Instance ID: {4629E455-0F9F-6EF9-4761-6D8BC5F0DCCA} 
Management group: <MG>

Log Name:      Operations Manager
Source:        Health Service Modules
Date:          22/01/2014 13:14:23
Event ID:      31551
Task Category: Data Warehouse
Level:         Error
Keywords:      Classic
User:          N/A
Computer:     <SERVER>
Description:
Failed to store data in the Data Warehouse. The operation will be retried.
Exception 'SqlException': OLE DB provider 'STREAM' for linked server '(null)' returned invalid data for column '[!BulkInsert].DateTime'. 

One or more workflows were affected by this.  

Workflow name: Microsoft.SystemCenter.DataWarehouse.CollectPerformanceData 
Instance name: <SERVER>
Instance ID: {4629E455-0F9F-6EF9-4761-6D8BC5F0DCCA} 
Management group: <MG>

I have looked at the relevent performance stats on the SQL Server hosting the DW Database.  I can confirm that Disk queues are consistently zero, and that idletime never drops below 98%.

It's probably worth noting that the above started happening following the DW database running out of space.  More space was allocate to the disk and the database files were expanded.  However, the above is now occurring.

Anyone have any idea what might be the cause?

Cheers

Shaun

CPU Monitoring

$
0
0

Can CPU can be monitored through System Center and what alerting options are available for monitoring?

How to cancel task which are in Queue state in SCOM console?

$
0
0

Hi All,

When I start any task it does not execute. It show task is in queue state. How can I cancel or execute this task?

Thanks

Agents are still showing Not Monitoring state in SCOM 2012 server

$
0
0

Hi All,

I have put 51 servers in maintenance mode for 300 mins. After 300 mins maintenace mode is over but All agents are still inNot Monitoring state.

Why this happen in SCOM ? What cloud be the work around to fix this issue? Please help me

Thanks


The provider collection was either null or empty when trying to create a subscription.

$
0
0

Hi,

In the Notification Subscription Wizard, when I try to filter on a certain Management Pack I receive following error:

Does anyone know what this is caused by or how to troubleshoot this?

I already tried clearing the console cache to no avail.

Grts.

Edit: The issue probably occured after setting up monitoring for a website using the "Web Application Availability Monitoring" template. I used an existing Mgmt Pack. I tried creating a new Mgmt Pack in the wizard and have the same issue when I filter on this new Mgmt Pack in the Notification Subsciption Wizard.

Operations Manager 2012 R2 Enterprise Alerting

$
0
0

Hi,

at the moment we plan the migration from SCOM 2007 R2 to SCOM 2012 R2.

We are looking for an enterprise alerting solution like the product from DERDACK. Do you know of any other vendors / products with the same capabilities?

Thank you,

Martin

HP Storage Management Pack 3.0

$
0
0

Hello,

Is any one successful in getting diagram view of HP storage devices using HP Storage Management Pack 3.0 for SCOM 2012.  I could successfully send SNMP test traps from storage nodes to SCOM server.  My configuration is Gateway server in an untrusted domain and have trouble connecting  HP Storage Management Pack User Configuration Tool.exeapplication to SCOM server from GW server domain.  It is not clear which port is used by this tool/application to communicate with SCOM server.  Any pointer in this regard is highly appreciated.

Thanks 

SCOM server monitoring

$
0
0

Hi,

Can we monitor a server through SCOM of which we cant take RDP directly but only  by logging in through another server. The server which we want to monitor is also in DMZ .

Thanks 

Really expensive discovery insertion - how can I trace it back?

$
0
0

Hello again folks. 

Weve had some pretty grim Ops DB performance over the last few days causing console slowdowns, los off data, the works.

Our DBA has identified the following query causing the locks for several minutes at a time.

CREATEPROCEDURE [dbo].[p_EntityTransactionLogBegin]

(

    @DiscoverySourceIduniqueidentifier,

    @ContextGeneratednvarchar(255)=NULL,

    @TransactionIdbigint= NULLOUTPUT

)

AS

BEGIN

    SETNOCOUNTON;

    DECLARE @Err int;

    DECLARE @LastModified datetime;


   UPDATE [dbo].[DiscoverySource]

   SET [TimeGeneratedOfLastSnapshot] = [TimeGeneratedOfLastSnapshot]

   WHERE [DiscoverySourceId] ='85AB926D-6E0F-4B36-A951-77CCD4399681'

    SELECT @Err = @@ERROR;

    IF (@Err <> 0) 

       GOTO Error_Exit;

      

    SET @LastModified=GETUTCDATE();

    INSERTINTO dbo.[EntityTransactionLog]

   (

        [DiscoverySourceId],

        [ContextGenerated],

        [LastModified],

        [TimeAdded],

        [IsCommitted]

    )

    VALUES

   (

        @DiscoverySourceId,

        @ContextGenerated,

        @LastModified,

        @LastModified,

        0

    );

    SELECT @Err = @@ERROR;

    SELECT @TransactionId = @@IDENTITY;

    IF (@Err <> 0)

       GOTO Error_Exit;

    IFOBJECT_ID('tempdb..#EntityTransaction')ISNOT NULL

    BEGIN

       INSERT #EntityTransaction

       (TransactionId)

       VALUES

       (@TransactionId);

    END

    RETURN 0;

Error_Exit:

    RETURN 1;

END

GO

This seems to be trying to insert half a million rows a time into the DB - no idea why?

Does anyone know how I can locate that discovery ID in the SCOM DB? Ive tried get-scommonitoring object -id without success.

SCOM 2007 R2 Root Management server showing Not Monitored State in Ops Mgr Console

$
0
0

Hello Experts,

In my Prod SCOM 2007 R2 environment RMS server state is "Not Monitored", But we are receiving alerts with limitation. By mistakenly I put Maintenance Mode while rebooting RMS server due to slow performance of the server.

Can anybody help me to revert back to the RMS Health state ?

Management Groups

$
0
0

Hello,

I need to move several agents to a new management group.

They are actually on the management group SCOM-PROD Management server OPMGRMS1.ad. they need to move to the management group SCOM-TEST management server OTMGRMS1.ad. Both management group are connected on OPMGRRMS1.ad

How could I do this?

Do I need to uninstall all agents and re-install them from the new RMS for SCOM-TEST?

Thanks,
Dom


System Center Operations Manager 2007 / System Center Configuration Manager 2007 R2 / Forefront Client Security / Forefront Identity Manager


Windows 2003 SP1 monitoring through SCOM 2012 R2

$
0
0

Hi All,

is it possible to monitor windows 2003 sp1 servers from scom 2012 r2?

When i am trying to install SCOM 2012 agent(i386), its throwing an error product can be installed from windows 2003 sp3 onwards.

Can any one help me out , how to install agent in windows 2003 sp1 servers. or is there any alternate ways to enable monitor

Discovered Inventory Data Warehouse Synchronization Server

$
0
0

I am getting an error checking this, object reference not set to an instance of an object

Date: 1/23/2014 1:59:46 PM
Application: Operations Manager
Application Version: 7.1.10226.0
Severity: Error
Message: 

System.NullReferenceException: Object reference not set to an instance of an object.
   at Microsoft.EnterpriseManagement.Common.EnterpriseManagementObjectBaseWithProperties.Reconnect(EnterpriseManagementGroup managementGroup)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Cache.Query`1.GetUpdate(IndexTable indexTable, QueryUpdate`1 update, CacheCursor cursor, Range range, Int32 offset, Int32 groupLevel)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Cache.QueryCache`2.GetUpdate(IndexTable indexTable, QueryUpdate`1 update)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Cache.QueryCache`2.GetUpdate(CacheSession session, Boolean fullUpdate)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Cache.QueryCache`2.FireUpdateEvent(CacheSession session, DateTime updateTime, Boolean dataChanged, Boolean fullUpdate, Boolean updatesOnly, IEnumerable queryResult)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Cache.Query`1.FireUpdateEvents(CacheSession session, Boolean dataChanged, Boolean fullUpdate, ICollection`1 queryResult)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Cache.Query`1.PostQuery(CacheSession session, IndexTable indexTable, UpdateReason reason, UpdateType updateType, Boolean dataChanged, DateTime queryTime, ICollection`1 queryResult)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Cache.Query`1.InternalSyncQuery(CacheSession session, IndexTable indexTable, UpdateReason reason, UpdateType updateType)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Cache.Query`1.InternalQuery(CacheSession session, UpdateReason reason)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Cache.Query`1.TryDoQuery(UpdateReason reason, CacheSession session)
   at Microsoft.EnterpriseManagement.Mom.Internal.UI.Console.ConsoleJobExceptionHandler.ExecuteJob(IComponent component, EventHandler`1 job, Object sender, ConsoleJobEventArgs args)

Group renaming issue in OpsMgr 2012 R2

$
0
0

Hi,

I was wondering if the group renaming issue in OpsMgr 2012 R2 is already addressed.
When renaming a group in the console it doesn't seem to be updated in the database correctly.

I found out about this when I target a scheduled maintenance job to a group which was renamed. After that the PowerShell script didn't work any more. When checking the group name in PowerShell with Get-SCOMGroup I found the old name showed up.

Thanks in advance,

Marthijn.

Event ID 717 - MSExchangeMonitoringCorrelation - Connection to the RMS has failed.

$
0
0

Hello.  I'm new to SCOM and need some help with errors I am receiving on our OM server.  We have 2 servers: OM and the DB server.  The OM server is hosting all the roles with the exception of the DB server.  The event log shows a few different errors with the same Event ID:717

Event Type: Warning
Event Source: MSExchangeMonitoringCorrelation
Event Category: General
Event ID: 717
Date:  1/24/2011
Time:  11:05:26 AM
User:  N/A
Computer: RFOM1
Description:
Connection with the Operations Manager Root Management Server has failed.

Error message: The connection to the Operations Manager Root Management Server 'localhost' has been disconnected.

Number of occurrence: 1

Retrying in 30 seconds...

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.

 

Event Type: Warning
Event Source: MSExchangeMonitoringCorrelation
Event Category: General
Event ID: 717
Date:  1/24/2011
Time:  12:45:32 AM
User:  N/A
Computer: RFOM1
Description:
Connection with the Operations Manager Root Management Server has failed.

Error message: The requested operation timed out.

Number of occurrence: 1

Retrying in 30 seconds...

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.

 

and lots of these:

Event Type: Warning
Event Source: MSExchangeMonitoringCorrelation
Event Category: General
Event ID: 717
Date:  1/21/2011
Time:  4:22:50 PM
User:  N/A
Computer: RFOM1
Description:
Connection with the Operations Manager Root Management Server has failed.

Error message: A Management Pack in the Management Group has been added, upgraded, overridden, or deleted.  A reconnection will be needed to detect the changes.

Number of occurrence: 1

Retrying in 30 seconds...

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.

Not sure what my next steps should be.

Thanks,
Dan

 

event 4000 on SCOM DB / DW server

$
0
0

While checking general health of our SCOM 2007 R2 database server (which also houses datawarehouse) I found in the operations manager log we have a constant repeating pattern of:

Event 1003

Summary: 849 rule(s)/monitor(s) failed and got unloaded, 0 of them reached the failure limit that prevents automatic reload........

then immediately:

Event 4000

A monitoring host is unresponsive or has crashed.  The status code for the host failure was 2164195371.

I did some research and found recommendations to install a hotfix, but that was designed for scom 2007 before R2.

The errors repeat in a sporadic manner.  Sometimes as little as 45 minutes between, sometimes up to 12 hours.

Please let me know how I can diagnose and repair. Thank you!


Viewing all 11941 articles
Browse latest View live