What do you alert on for CPU on Linux?
Hi,
For Windows, we use the standard CPU Percent to alert when a server is running hot. We do the same for Linux, but we also have those MinLoadPerCore alerts that we get all the time on various machines.
When we get the MinLoad alerts, we look at CPU and usually find that it's not running super high so we ignore them. On some Noisy machine we just keep upping the threshold from 1 to 1.2 to 1.5 etc until we stop getting alerts. That seems kind of pointless and I'm leaning towards just turning off alerting on the MinLoad datapoints completely.
So my main question is, what do you all alert on? Do you find the MinLoadPerCore alerts to be valuable? When you get one, do you take steps to up the CPU count on those machines even if the CPU usage isn't super high?
Just looking to see what everyone else does.
Thanks