Customizing HostStatus datasource to alert on "No Data" ???
Can anyone share some feedback on customizing default global definition behavior for datasource HostStatus (HR6FND)?
More specifically, I'm looking for some ideas about changing the behavior of either the heartbeat or idleInterval datapoints: by default neither triggers an alert if there is No Data. At some point in the past I thought about changing this so that an alert would be triggered, but then I seem to recall that there were some reasons where I thought this might produce unintended consequences and so I needed to consider more carefully and thoroughly whether this would be a good idea. Unfortunately, I never returned to that thought exercise and I also didn't keep good notes because I don't remember what those reasons were.
Fast-forward to last week and we had a situation where a monitored resource suffered failure that we would have caught much sooner if we were alerting (to a chain with notification delivery) on "No Data" The resource did not stop responding to Ping, but nearly all of its snmp-based datasources/datapoints started returning No Data; without the alert and notification we did not realize that the resource had stopped providing service.
So I'm thinking about this again now. Any/all ideas are welcome.
- Anonymous3 years ago
The HostStatus datsource should always have data, even if the device is not responding. We have a >300 threshold on the idleInterval datapoint.
I've thought about making an SNMP troubleshooter, but that would be so similar to the SNMP_Host_Uptime DS that i then thought about just putting a no data alert on the Uptime datapoint there. I never actually did it, but I may do in the future. Should be the canary for SNMP monitoring.