10-01-2012 03:58 PM
We run a managed hosting environment and run 25 instances of the Stingray traffic manager. As we continue to add more Stingray traffic managers I’m running into issues monitoring all of them. Currently we have email alerts configured to send out notifications when there are any issues detected. This works well but at times we get bombarded with Stingray emails and if someone misses an email the traffic manger alarm will get overlooked. We also use SNMP alarms that monitor specific OIDs (CPU, disk space, node failures,…) to display ongoing alarms on a dashboard in our NOC. This works well for us so all engineers are notified if there is a traffic manager in alarm. The issue I run into with this is the SNMP monitors do not always catch issues with the traffic managers.
Is there a way to poll the overall status of the traffic mangers via SNMP or API? For example is there a way to run the green, yellow, red status that display in the top right corner of the GUI?
Is there a dashboard in development that I can connect all of our traffic mangers to easily administer them? In our case multi-site manager will not work because we have to keep each traffic manager independent of each other.
Has anyone found a nice method for monitoring the traffic managers? Here are a few SNMP values I’ve been using:
CPU percentage: .184.108.40.206.4.1.2021.10.1.3.1
IP forward is enabled: .220.127.116.11.18.104.22.168.0 (we have to use IP forwarding and the traffic mangers likes to disable this after updates)
Ram Free: .22.214.171.124.4.1.2021.4.11.0
Disk space used in Root: .126.96.36.199.188.8.131.52.184.108.40.206 / .220.127.116.11.18.104.22.168.22.214.171.124 * 100
Disk space used by logs .126.96.36.199.188.8.131.52.184.108.40.206 / .220.127.116.11.18.104.22.168.22.214.171.124 * 100
Node status: .126.96.36.199.4.1.7188.8.131.52.2.1.4.X
Solved! Go to Solution.
10-08-2012 12:05 PM
How about diagnoseSystem() described on page 444 of the control api documentation:
10-08-2012 12:14 PM
I've been looking at using that but I was having issue pulling this into our monitoring system and generating the logic that determines the difference between red, yellow, and green. Do you have the logic that is used on the main GUI to display the red, yellow, or green status?