Sum across DataSources and dynamic alert levels
I would like to monitor our VMWare environment to determine if memory and/or cpu assignments would exceed our capacity to continue to run all VMs if we lost an ESX Host server.
Something like:
We have two dynamic groups created:
ESXHost = system.version =~ "VMWare"
ESXVMs = system.model == "VMware Virtual Platform"
Datapoints needed:
ESXHOSTCPU: SUM CPU MHZ for all ESX Servers
ESXHOSTMEM: SUM MEMORY installed for all ESX Servers.
ESXVMCPU: SUM CPU MHZ assigned for all VMs.
ESXVMMEM: SUM MEMORY assigned for all VMs.
Calculations:
(ESXVMCPU/ESXHOSTCPU)*100
(ESXVMMEM/ESXHOSTMEM)*100
Alert Conditions
Dynamic alert levels would be great.
Dynamic Alert level would be 100 – (100/(Total Members of group ESXHost))
In my case I have 3 ESX HOST so the alert level would be something like > 66
If the sum of memory or CPU MHZ assigned to all VMs exceeded 66% of the combined CPU and MEMORY available in all ESX servers then we would not be able to keep all VMs running if we lost an ESX Host.