Scripted Alert Thresholds
It should be possible to groovyscript Alert Thresholds, based on (for example) ILPs and hostProperties. I need to modify the SNMP_Network_Interfaces to vary the InDiscardPercent threshold depending on whether this is a radioMAC interface type and whether it is a given customer. Something along the lines of: def isRadio = instanceProps.get('auto.interface.type') == 'radioMAC'; def customerCode = hostProps.get('customer.code'); if(isRadio && customerCode == 'ACME') { // No threshold return ''; } // The default return '> 10';Solved224Views17likes26Comments6 polls, 5 poll cycles, or 12 minutes
Alright, take a look at this. The poll rate for this DS is 2 minutes. What is the trigger window? Assume all the criteria is met starting a 1:59PM and 30 seconds (between polls). When will the alert be triggered? It’s actually 10 minutes (2:10 PM, +/- a couple seconds for collector task queue delay and assuming original scheduling at the top of the hour). Do you know why?Why does it say 12 minutes? Why not 10? Is this behavior different than regular alert trigger intervals?Solved61Views2likes3CommentsInstance Management - Alerts
Hi, Curious to know how people have managed setting alert thresholds for datasources where new instances will not be recognised. The notice I get is: “These changes will only apply to existing instances of this datasource on this device. They will not apply to instances added later” This is a challenge for large environments where it’s easy to forget this fact, and potentially miss monitoring new instances altogether. Keen to get feedback on any strategies used to address this issue.Solved81Views1like4CommentsAdd alert timeframe to include days of the week
A feature enhancement that enables alerts to be limited to certain days of the week as well as hours/mins would be very beneficial as there are often occasions when an alert is needed in the working week but not at the weekend. An example is NetApp snapmirror lagtime. Mon-Sat these are set to replicate but not on a Sunday. We look for 24 hour lag most of the time to see an issue but on a Monday this would be 48 hours (as there would have been no snapmirror since the Sat). I appreciate I can create ways to manage alerts using time based escalations however there is no way to affect the alerts view on the dashboard with this approach. Hopefully something that other might also want which can be added in the future?8Views1like3CommentsDetailed Alert Thresholds
I get asked all the time to detail what alert thresholds are set for a group of devices. The current standard report doesn't provide the data to a low enough level. At present the report shows what could be set if the DS is assigned and what is set different to global. What is required is a report that details all the alert thresholds that are actually set for each applied DS for each device in turn. This will allow the teams to easily see and set thresholds for each device. If there is a way of doing this already, please point me in the right direction.4Views1like1Comment