alerting

5 Topics

Custom Alert Routing Suggestion
In the new UI I really appreciate the ability to set custom Static Threshold or Dynamic Thresholds on a DataSource along with Time based alerting right on the page of the alert. This is a useful feature when dealing with one off DataSource’s. The feature would use a lot more would be to add custom “Alert Routing”. An example would be SQL Server- which is an older module, we still use, the resource itself the box is managed by my System Admin team, while the SQL Server Datasource is managed by my Database team. It’s just a suggestion, I know my org is very siloed, just a useful item that would make my life a lot easier.
JosiahBenoit
10 months ago Place Product Discussions
61Views
3likes
3Comments
Can someone explain the CPU 5 minute load average thing to me?
So, We have a server currently alerting us because the 5MinLoadPerCore field is > 1. I'm trying to understand why that is. I found this page that says if that number is >1, it means there are things queued up waiting for the CPU and there's a backlog. https://www.logicmonitor.com/blog/what-the-heck-is-cpu-load-on-a-linux-machine-and-why-do-i-care However, the server in question has the CPUs currently running at around 60%. I would think that if it were backed up, it should be cranking at 99% trying to catch up. I would think the 5minload alert and a CPU Usage percentage alert would come as a pair, but they don't. Just trying to figure out if there's anything that can be done when we get the 5Min alerts or if they're more just informational and can be ignored. They only come in as a Warning anyway, so if it's just informational, then it's just noise, and maybe we'll just turn them off. Just looking for other opinions. ;) Thanks.
Kelemvor
2 years ago Place Product Discussions
1.1KViews
2likes
3Comments
How do you handle Disk Space alerting?
Hi, The LM standard is to do disk space alerting based on the percentage of space used. E.g. 90% warning, 95% error, 98% critical (or whatever). This is great for machines with an average size drive, but is completely useless for machine with giant drives. We have some machines that might have drives that are multiple Terabytes in size. Getting an alert when a 2TB drive is 90% full doesn't help because it still has 200 Gigs free. For all our Windows servers, we've added additional alerting based on a hard coded 10 Gigs free on the C drive because Windows Updates generally have issues if you have less than that. This has helped a bunch for smaller drives where 10 Gigs free isn't small enough to hit the Percent-based alerting because maybe it's only a 60 Gig C drive. I'm just wondering what everyone else does for these types of alerts. Do you use the percentage based alerts for most machines but then create new ones, or change the percentage for large or small servers? Do you change everything to a hard size limit? Some other combination? Just looking for ideas so we can try to reduce the unnecessary alerts for servers with huge drives and get alerts for ones with tiny drives. Thanks.
Kelemvor
2 years ago Place Product Discussions
140Views
3likes
1Comment
Netflow Alerting Rules
Not only restricting you to visualising the Netflow data on LM Platform. Interestingly, the most recent improvement to LogicMonitor Netflow is the Traffic Alerting Rules. It is possible to set up traffic alert rules for the NetFlow resources to get alerts when a resource's traffic hits a specific threshold, drops off for a specified length of time, etc. Traffic Alerting Rules feature are available and you can create rules at: Traffic Alert Rule at Group Level Traffic Alert Rule at Resource Level Don't miss out on the advantages of this feature and refer the below link for more details. https://www.logicmonitor.com/support/traffic-alert-rule
mr_ravimishra
3 years ago Place Product Discussions
129Views
18likes
0Comments
Threshold Duration
Hello, I was wondering if there was a possibility to impose a duration constraint on a threshold in LogicMonitor... I see where you can enable dynamic alters but was not sure if they would look back to the duration of the alert rather than just a floating data point that it would attempt to normalize. Thanks in Advance
Solved
starboy9
6 years ago Place Product Discussions
55Views
0likes
1Comment