In Abiquo 4.6, the monitoring system has been improved to better support large numbers of VMs. The improvements were made based on the results of a simulation where a script deployed VMs and modified metrics to activate and deactivate alarms and alerts. For the simulation, 5000 VMs were deployed on 5 hypervisor emulators running as Docker containers. During the simulation, 2 metrics of each VM had values that forced the activation of alarms and alerts for a 15 minute period, followed by default values.
The activated alarms and alerts were quickly detected and notified by the system. However, improvements were made based on the results of the simulation, including:
Optimized configuration of KairosDB incoming queue processor
Increased 'batch_size' (requires changing Cassandra configuration), we should try to dimension with the expected vm / metrics / data points per minute