CMU 7.1 - Actions and Alerts Issue (205 Views)
Reply
Advisor
BrentGee
Posts: 24
Registered: ‎11-01-2013
Message 1 of 3 (205 Views)

CMU 7.1 - Actions and Alerts Issue

Hi Again:

 

Trying to gather statistcs in the new gui. On BL2x220 G6s, no problem everything works great. However, on my BL2x220 G5s, I can't see any statistics whatsoever. I tried using the native commands and the collectl commands. No data whatsoever.

 

Is there a way I can troubleshoot this?

Please use plain text.
Advisor
Rakshika
Posts: 12
Registered: ‎03-12-2012
Message 2 of 3 (169 Views)

Re: CMU 7.1 - Actions and Alerts Issue

Hello,

 

Please check following things;

- Is passwordless ssh enabled from head node to problematic nodes?

- Is CMU monitoring agent (cmu_cn-7.1-1.x86_64.rpm) is installed on those nodes?

- Do you have firewall running between head node and problematic compute node?

- Are monitoing agents running on problematic nodes?

  Please run this command on head node;

   # /opt/cmu/bin/pdsh -w <nodelist> ps -ef | grep Monitoring | /opt/cmu/bin/dshbak -c

       <nodelist> should be hostname of nodes having problem. eg: n[1-5]

- Look in these logs to debug issue;

    Head node: 

         /opt/cmu/log/MainMonitoringDaemon_<head node name>.log

    Node on which secondary monitoring daemon is ruuning (above command will show this):          

         /opt/cmu/log/SecondaryServerMonitoring_<compute node name>.log

    Problematic compute nodes: 

        /opt/cmu/log/SmallMonitoringDaemon_n1.log

           

If it doesn't solve problem, please post above things here and send us logs.

Please use plain text.
Advisor
BrentGee
Posts: 24
Registered: ‎11-01-2013
Message 3 of 3 (139 Views)

Re: CMU 7.1 - Actions and Alerts Issue


Rakshika wrote:

Hello,

 

Please check following things;

- Is passwordless ssh enabled from head node to problematic nodes?

YES

- Is CMU monitoring agent (cmu_cn-7.1-1.x86_64.rpm) is installed on those nodes?

YES

- Do you have firewall running between head node and problematic compute node?

NO

- Are monitoing agents running on problematic nodes?

YES

 

 

Will get back to you with regard to the following instructions:

 

  Please run this command on head node;

   # /opt/cmu/bin/pdsh -w <nodelist> ps -ef | grep Monitoring | /opt/cmu/bin/dshbak -c

       <nodelist> should be hostname of nodes having problem. eg: n[1-5]

- Look in these logs to debug issue;

    Head node: 

         /opt/cmu/log/MainMonitoringDaemon_<head node name>.log

    Node on which secondary monitoring daemon is ruuning (above command will show this):          

         /opt/cmu/log/SecondaryServerMonitoring_<compute node name>.log

    Problematic compute nodes: 

        /opt/cmu/log/SmallMonitoringDaemon_n1.log

           

If it doesn't solve problem, please post above things here and send us logs.


Sorry for the delay. However, now that all of the other issues have been resolved, I will be able to concentrate on my final problem with 7.1. Thank you for all of these suggestions.

Please use plain text.
The opinions expressed above are the personal opinions of the authors, not of HP. By using this site, you accept the Terms of Use and Rules of Participation