Forum Discussion
Sarah_Terry
8 years agoProduct Manager
Hi Anil,
There's a default threshold on the UnHealthyHostCount datapoint for Load Balancers (>0 triggers a warning), but you can always set a threshold of <1 on the HealthyHostCount datapoint to alert on fewer than one healthy instance. Re the time contingency of 45 mins, there are a couple of ways to do this:
- 1. Adjust the Alert Trigger Interval in the Load Balancer DataSource definition: the alert trigger interval in the datapoint definition controls how many consecutive polling intervals the threshold condition must be true before an alert is triggered. This means if you have a threshold of <1 for HealthyHostCount, but an Alert Trigger Interval of 30 (or 60), and a collection interval of 1 minute, there would have to be fewer than one healthy load balancer hosts for 30 minutes (or 60 minutes), before an alert triggered. If the polling interval is set to every 2 minutes, you can get closer to the 45 min mark by setting the trigger interval to 20 or 24.
- 2. Make sure the alert is active for 45 mins before routing: you could set up an escalation chain where stage 1 is empty or a non-disruptive destination like a chat tool, stage 2 is set to email or text you, and the escalation interval is 45 minutes. The result of that setup would be this: when HealthyHostCount drops below 1, a warning alert is triggered and sent to stage 1. After 45 minutes, if the condition is still true (HealthyHostCount still less than 1), an alert would be sent to you.
Thanks,
Sarah
Related Content
- 2 years ago