Download PDF
Download page Cluster Metrics.
Cluster Metrics
The Cluster Agent Dashboard metrics are derived from the Kubernetes API and report information about the Clusters and Pods.
The Cluster Agent reports events on these Kubernetes and Hardware resources on any defined set of namespaces. We monitor cluster health and Kubernetes objects:
- Cluster Summary Metrics
- CPU
- DaemonSets
- Deployments
- Endpoints
- Jobs
- Memory
- Nodes
- Pods
- PVC
- ReplicaSets
- Services
- Storage Quota
Cluster Summary Metrics
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Error events count | The number of error events. | Dashboard > Errors | Hardware Resources|Cluster|Error events count |
Evicted pods count | The number of evicted pods. | Pods > Evicted | Hardware Resources|Cluster|Evicted pods count |
Eviction threats count | The number of events that represent pod evictions. | Dashboard > Errors | Hardware Resources|Cluster|Eviction threats count |
Image pull errors | The number of image pull errors. | Dashboard > Issues > Image Issues | Hardware Resources|Cluster|Image pull errors |
Image pulls | The number of image pulls. | Dashboard > Issues > Image Issues | Hardware Resources|Cluster|Image pulls |
Info events count | The number of informational events. | Dashboard > Errors | Hardware Resources|Cluster|Info events count |
Pod errors | The number of errors related to pods. | Dashboard > Issues > Pod Issues | Hardware Resources|Cluster|Pod errors |
Pod Kills | The number of pods that were killed. | Inventory > Pods > Pod Kills | Hardware Resources|Cluster|Pod Kills |
Pod restarts | The number of times the pods restarted. | Dashboard > Issues > Pod Issues | Hardware Resources|Cluster|Pod restarts |
Pods Scaledowns | You can scale down your deployments and replica sets. The count of scaledowns. | Inventory > Pods > Scaledowns | Hardware Resources|Cluster|Pods Scaledowns |
Pods count | Total count of pods. | Inventory > Pods > Phases > Normal | Hardware Resources|Cluster|Pods count |
Pods failed | The number of failed pods. | Pods > Failed | Hardware Resources|Cluster|Pods failed |
Pods pending | The number of pods in a pending state. Pending status normally indicates an issue. For more information see the Kubernetes documentation. | Pods > Pending | Hardware Resources|Cluster|Pods pending |
Pods running | The number of pods in a running state. | Pods > Running | Hardware Resources|Cluster|Pods running |
Pods succeeded | The number of pods in Succeeded phase. | Dashboard > Pods By Phase | Hardware Resources|Cluster|Pods succeeded |
Pods unknown | The number of pods in Unknown state. | Dashboard > Pods By Phase | Hardware Resources|Cluster|Pods unknown |
Pods with Missing Dependencies - Config Maps and Secrets | If a pod is dependent on any Config Maps & Secrets, then those dependencies are missing. | Inventory > Pods > Missing Dependencies - Config Maps and Secrets | Hardware Resources|Cluster|Pods With Missing Dependencies - Config Maps And Secrets (Pod Metrics for Inventory Tab) |
Pods with Missing Dependencies - Services | If a pod is dependent on any Services, then those dependencies are missing. | Inventory > Pods > Missing Dependencies - Services | Hardware Resources|Cluster|Pods With Missing Dependencies (Pod Metrics for Inventory Tab) |
Pods with No Limits | The number of pods with no limits (on CPU/memory) set. If you have specified limits on any pod that you are starting, this metric will tell you how many pods don't have a limit defined. Displayed in the Inventory Tab, under Pod Metrics. | Inventory > Pods > No Limits | Hardware Resources|Cluster|Pods With No Limits |
Pods With No Liveness Probe | The number of pods with no liveness probe. If you've configured a probe in Kubernetes to monitor liveness, the values will be displayed in the Inventory Tab, under Pod Metrics. | Inventory > Pods > No Probes -Liveness | Hardware Resources|Cluster|Pods With No Liveness Probe |
Pods With No Readiness Probe | The number of pods with no readiness probe. If you've configured a probe in Kubernetes to monitor readiness the values will be displayed in the Inventory Tab, under Pod Metrics. | Inventory > Pods > No Probes -Readiness | Hardware Resources|Cluster|Pods With No Readiness Probe |
Privileged Pods | The number of privileged pods that run with root access. Displayed in the Inventory Tab, under Pod Metrics. | Inventory > Pods > Privileged | Hardware Resources|Cluster|Privileged Pods |
Storage errors | The overall number of errors related to storage for the cluster. | Inventory > Pod Metrics | Hardware Resources|Cluster|Storage errors |
Storage quota violations | The number of storage quota violations. If someone exceeds that quota. | Inventory > Pod Metrics | Hardware Resources|Cluster|Storage quota violations |
CPU
CPU Capacity
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Total (MilliCores) | This metric depicts the total CPU capacity for the cluster in Millicores. | Cluster Capacity > CPU | Hardware Resources|Cluster|CPU|Capacity|Total (MilliCores) |
Used (MilliCores) | This metric depicts the CPU capacity already used by the cluster in Millicores. | Cluster Capacity > CPU | Hardware Resources|Cluster|CPU|Capacity|Used (MilliCores) |
CPU Quota
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Limit Used (%) | The percentage of CPU limit quota used. | Dashboard > Quotas > CPU Limit | Hardware Resources|Cluster|CPU|Quota|Limit Used (%) |
Limit Used (MilliCores) | The Millicores value for CPU limit quota used. | Dashboard > Quotas > CPU Limit | Hardware Resources|Cluster|CPU|Quota|Limit Used (MilliCores) |
Request Used (%) | The percentage of CPU request quota used. | Dashboard > Quotas > CPU Request | Hardware Resources|Cluster|CPU|Quota|Request Used (%) |
Request Used (MilliCores) | The Millicores value for CPU request quota used. | Dashboard > Quotas > CPU Request | Hardware Resources|Cluster|CPU|Quota|Request Used (Millicores) |
CPU Utilization
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Limit (MilliCores) | The limit of CPU which can be used by the pods. Only the pods belonging to monitored namespaces are considered to calculate this metric. | Dashboard > Utilization > CPU | Hardware Resources|Cluster|CPU|Utilization|Limit (MilliCores) |
Request (MilliCores) | The Millicore value of CPU which all the pods in monitored namespaces have requested for. | Dashboard > Utilization > CPU | Hardware Resources|Cluster|CPU|Utilization|Request (MilliCores) |
Used (MilliCores) | The actual CPU which the pods from monitored namespaces are currently using. | Dashboard > Utilization > CPU | Hardware Resources|Cluster|CPU|Utilization|Used (MilliCores) |
DaemonSets
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Count | The number of daemon sets that exist. | Inventory > Objects > DaemonSets > (Count) | HardwareResources|Cluster|DaemonSets|Count |
Nodes Available | The number of nodes that are running and available on the cluster. | Inventory > Objects > DaemonSets > Available | HardwareResources|Cluster|DaemonSets|Nodes Available |
Nodes MissScheduled | The number of nodes that are running, but shouldn't be running. | Inventory > Objects > DaemonSets > MissScheduled | HardwareResources|Cluster|DaemonSets|Nodes MissScheduled |
Nodes Unavailable | The number of nodes that should be running, but are not running. | Inventory > Objects > DaemonSets > Unavailable | HardwareResources|Cluster|DaemonSets|Nodes Unavailable |
Deployments
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Count | The number of deployments that exist in the cluster. | Inventory > Objects > Deployments > (Count) | HardwareResources|Cluster|Deployments|Count |
Replicas | The number of pod replicas in the cluster that are not in a terminated state. | Inventory > Objects > Deployments > Available | HardwareResources|Cluster|Deployments|Replicas |
Replicas Unavailable | The total number of unavailable pod replicas across all deployments in the cluster. | Inventory > Objects > Deployments > Unavailable | HardwareResources|Cluster|Deployments|ReplicasUnavailable |
Endpoints
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Count | The number of endpoints in the cluster. | Inventory > Services > Endpoints > Count | HardwareResources|Cluster|Endpoints|Count |
Not Ready Address | The total number of not ready addresses for all the endpoints in the cluster. | Inventory > Services > Endpoints without ready IP | HardwareResources|Cluster|Endpoints|Not Ready Address |
Orphans | The total number of endpoints in the cluster which do not have any ready nor any not ready addresses. | Inventory > Services > Orphan Endpoints with no IP | HardwareResources|Cluster|Endpoints|Orphans |
Ready Address | The total number of ready addresses for all the endpoints in the cluster. | Inventory > Services > Endpoints | HardwareResources|Cluster|Endpoints|Ready Address |
Jobs
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Count | The total number of jobs in the cluster. | Inventory > Objects > Jobs > (Count) | Hardware Resources|Cluster|Jobs|Count |
Pods Active | The total number of active pods for all the jobs in the cluster. | Inventory > Objects > Jobs > Active | Hardware Resources|Cluster|Jobs|Pods Active |
Pods Failed | The total number of pods which reached phase Failed for all the jobs in the cluster. | Inventory > Objects > Jobs > Failed | Hardware Resources|Cluster|Jobs|Pods Failed |
Pods Succeeded | The total number of pods which reached phase Succeeded for all the jobs in the cluster. | Inventory > Objects > Jobs > Succeeded | Hardware Resources|Cluster|Jobs|Pods Succeeded |
Memory
Memory Capacity
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Total (MB) | This metric depicts the total Memory capacity for the cluster in MBs. | Dashboard > Cluster Capacity > Memory | Hardware Resources|Cluster|Memory|Capacity|Total (MB) |
Used (MB) | This metric depicts the Memory capacity already used by the cluster in MBs. | Dashboard > Cluster Capacity > Memory | Hardware Resources|Cluster|Memory|Capacity|Used (MB) |
Memory Quota
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Limit Used (%) | The percentage of Memory limit quota used. | Dashboard > Quotas > Memory Limit | Hardware Resources|Cluster|Memory|Quota|Limit Used (%) |
Limit Used (MB) | The MB value for Memory limit quota used. | Dashboard > Quotas > Memory Limit | Hardware Resources|Cluster|Memory|Quota|Limit Used (MB) |
Request Used (%) | The percentage of Memory request quota used. | Dashboard > Quotas > Memory Request | Hardware Resources|Cluster|Memory|Quota|Request Used (%) |
Request Used (MB) | The MB value for Memory request quota used. | Dashboard > Quotas > Memory Request | Hardware Resources|Cluster|Memory|Quota|Request Used (MB) |
Memory Utilization
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Limit (MB) | The limit of Memory which can be used by the pods. Only the pods belonging to monitored namespaces are considered to calculate this metric. | Dashboard > Utilization > Memory | Hardware Resources|Cluster|Memory|Utilization|Limit (MB) |
Request (MB) | The MB value of Memory which all the pods in monitored namespaces have requested for. | Dashboard > Utilization > Memory | Hardware Resources|Cluster|Memory|Utilization|Request (MB) |
Used (MB) | The actual Memory which the pods from monitored namespaces are currently using. | Dashboard > Utilization > Memory | Hardware Resources|Cluster|Memory|Utilization|Used (MB) |
Nodes
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Master Count | The number of master nodes in the cluster. | Inventory > Masters | Hardware Resources|Cluster|Nodes|Master Count |
Worker Count | The number of worker nodes in the cluster. | Inventory > Workers | Hardware Resources|Cluster|Nodes|Worker Count |
Memory Pressure Count | The number of nodes that are under memory pressure in the cluster. | Inventory > Memory Pressure | Hardware Resources|Cluster|Nodes|Memory Pressure Count |
Disk Pressure Count | The number of nodes that are under disk pressure in the cluster. | Inventory > Disk Pressure | Hardware Resources|Cluster|Nodes|Disk Pressure Count |
Pods
Pods Capacity
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Total Count | The total number of pods that a cluster can support. | Pods > Total Count | Hardware Resources|Cluster|Pods|Capacity|Total Count |
Used Count | The number of pods already created in the cluster. | Pods > Count | Hardware Resources|Cluster|Pods|Capacity|Used Count |
PVC
PVC Quota
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Used | PVC quota already being used in the cluster. (count) | Dashboard > Quotas > PVC | Hardware Resources|Cluster|PVC|Quota|Used |
Used % | Percentage of PVC quota already being used in the cluster. | Dashboard > Quotas > PVC | Hardware Resources|Cluster|PVC|Quota|Used (%) |
PVC Utilization
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Capacity (MB) | The total PVC available for the pods in the monitored namespaces. | Dashboard > Utilization > PVCs | Hardware Resources|Cluster|PVC|Utilization|Capacity (MB) |
Request (MB) | The value for PVC requested by pods in monitored namespaces. | Dashboard > Utilization > PVCs | Hardware Resources|Cluster|PVC|Utilization|Request (MB) |
ReplicaSets
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Count | The number of replica set resources in the cluster. | Inventory > Objects > ReplicaSets > Count | Hardware Resources|Cluster|Count |
Replicas | The total number of replicas for all the replica sets in the cluster. | Inventory > Objects > ReplicaSets > Count | Hardware Resources|Cluster|ReplicaSets|Replicas |
Replicas Available | The total number of available replicas for all the replica sets in the cluster. | Inventory > Objects > ReplicaSets > Available | Hardware Resources|Cluster|ReplicaSets|Replicas Available |
Replicas Unavailable | The total number of unavailable replicas for all the replica sets in the cluster. | Inventory > Objects > ReplicaSets > Unavailable | Hardware Resources|Cluster|ReplicaSets|Replicas Unavailable |
Services
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Count | The total number of Kubernetes Services running in the cluster. | Inventory > Services > Services | Hardware Resources|Cluster|Services|Count |
Storage Quota
Metric Name | Description | UI Location | Metric Path |
---|---|---|---|
Used (MB) | The storage quota used by the cluster in MB. | Dashboard > Quotas > Storage | Hardware Resources|Cluster|Storage|Quota|Used (MB) |
Used (%) | The percentage of storage quota used by the cluster. | Dashboard > Quotas > Storage | Hardware Resources|Cluster|Storage|Quota|Used (%) |