Custom Controller Utilization Alert Thresholds
Overview
An Avi Controller cluster is continuously collecting CPU, disk and memory utilization metrics from the cluster nodes themselves. When the usage threshold is exceeded a system alert is raised to notify the admin. By default this threshold has been preconfigured to be 85% for CPU, disk and memory.
In some deployments this predefined threshold may not be conservative enough and a lower value is desired. The following will provide an example for how to modify these thresholds to meet your deployment’s requirements.
Threshold Configuration Options
Custom thresholds can independently be defined for:
CONTROLLER_CPU_THRESHOLD
CONTROLLER_MEM_THRESHOLD
CONTROLLER_DISK_THRESHOLD
When defining the configuration there are two threshold options to be aware of:
-
watermark_thresholds: Threshold value for which event is raised. There can be multiple thresholds defined. Health score degrades when the the target is higher than this threshold.
-
reset_thresholds: The value used to reset the event state machine.
Configure Controller Thresholds:
The following configuration provides an example of overwriting the default values. This example will configure a watermark_threshold
of 75 and a reset_threshold
of 60 for CPU, disk and memory. With this configuration, if resource utilization of any of these three exceed 75%, an alert will be raised for that resource. Once utilization drops below 60% the alert will be reset.
[admin:controller]: > configure systemconfiguration
[admin:controller]: systemconfiguration> controller_analytics_policy
[admin:controller]: systemconfiguration:controller_analytics_policy> metrics_event_thresholds
[admin:controller]: systemconfiguration:controller_analytics_policy> metrics_event_thresholds metrics_event_threshold_type controller_cpu_threshold
New object being created
[admin:controller]: systemconfiguration:controller_analytics_policy:metrics_event_thresholds> reset_threshold 60 watermark_thresholds 75
[admin:controller]: systemconfiguration:controller_analytics_policy:metrics_event_thresholds> save
[admin:controller]: systemconfiguration:controller_analytics_policy> metrics_event_thresholds metrics_event_threshold_type controller_mem_threshold
New object being created
[admin:controller]: systemconfiguration:controller_analytics_policy:metrics_event_thresholds> reset_threshold 60 watermark_thresholds 75
[admin:controller]: systemconfiguration:controller_analytics_policy:metrics_event_thresholds> save
[admin:controller]: systemconfiguration:controller_analytics_policy> metrics_event_thresholds metrics_event_threshold_type controller_disk_threshold
New object being created
[admin:controller]: systemconfiguration:controller_analytics_policy:metrics_event_thresholds> reset_threshold 60 watermark_thresholds 75
[admin:controller]: systemconfiguration:controller_analytics_policy:metrics_event_thresholds> save
[admin:controller]: systemconfiguration:controller_analytics_policy> save
[admin:controller]: systemconfiguration> save
+----------------------------------+------------------------------------+
| Field | Value |
+----------------------------------+------------------------------------+
| controller_analytics_policy | |
| metrics_event_thresholds[1] | |
| reset_threshold | 60.0 |
| watermark_thresholds[1] | 75 |
| metrics_event_threshold_type | CONTROLLER_CPU_THRESHOLD |
| metrics_event_thresholds[2] | |
| reset_threshold | 60.0 |
| watermark_thresholds[1] | 75 |
| metrics_event_threshold_type | CONTROLLER_MEM_THRESHOLD |
| metrics_event_thresholds[3] | |
| reset_threshold | 60.0 |
| watermark_thresholds[1] | 75 |
| metrics_event_threshold_type | CONTROLLER_DISK_THRESHOLD |
+----------------------------------+------------------------------------+
Document Revision History
Date | Change Summary |
---|---|
December 20, 2021 | New KB for 21.1.3 |