The Cluster Agent Dashboard metrics are derived from the Kubernetes API and report information about the Clusters and Pods.

The Cluster Agent reports events on these Kubernetes and Hardware resources on any defined set of namespaces. We monitor cluster health and Kubernetes objects:

Cluster Summary Metrics

Metric NameDescriptionUI LocationMetric Path
Error events countThe number of error events.Dashboard > ErrorsHardware Resources|Cluster|Error events count
Evicted pods countThe number of evicted pods.Pods > EvictedHardware Resources|Cluster|Evicted pods count
Eviction threats countThe number of events that represent pod evictions.Dashboard > ErrorsHardware Resources|Cluster|Eviction threats count
Image pull errorsThe number of image pull errors.Dashboard > Issues > Image IssuesHardware Resources|Cluster|Image pull errors
Image pullsThe number of image pulls.Dashboard > Issues > Image IssuesHardware Resources|Cluster|Image pulls
Info events countThe number of informational events.Dashboard > ErrorsHardware Resources|Cluster|Info events count
Pod errorsThe number of errors related to pods.Dashboard > Issues > Pod IssuesHardware Resources|Cluster|Pod errors
Pod KillsThe number of pods that were killed.Inventory > Pods > Pod KillsHardware Resources|Cluster|Pod Kills
Pod restartsThe number of times the pods restarted.Dashboard > Issues > Pod IssuesHardware Resources|Cluster|Pod restarts
Pods ScaledownsYou can scale down your deployments and replica sets. The count of scaledowns.Inventory > Pods > ScaledownsHardware Resources|Cluster|Pods Scaledowns
Pods countTotal count of pods.Inventory > Pods > Phases > NormalHardware Resources|Cluster|Pods count
Pods failedThe number of failed pods.Pods > FailedHardware Resources|Cluster|Pods failed
Pods pendingThe number of pods in a pending state. Pending status normally indicates an issue. For more information see the Kubernetes documentation.Pods > PendingHardware Resources|Cluster|Pods pending
Pods runningThe number of pods in a running state.Pods > RunningHardware Resources|Cluster|Pods running
Pods succeededThe number of pods in Succeeded phase.Dashboard > Pods By PhaseHardware Resources|Cluster|Pods succeeded
Pods unknownThe number of pods in Unknown state.Dashboard > Pods By PhaseHardware Resources|Cluster|Pods unknown
Pods with Missing Dependencies - Config Maps and SecretsIf a pod is dependent on any Config Maps & Secrets, then those dependencies are missing.Inventory > Pods > Missing Dependencies - Config Maps and SecretsHardware Resources|Cluster|Pods With Missing Dependencies - Config Maps And Secrets (Pod Metrics for Inventory Tab)
Pods with Missing Dependencies - ServicesIf a pod is dependent on any Services, then those dependencies are missing.Inventory > Pods > Missing Dependencies - ServicesHardware Resources|Cluster|Pods With Missing Dependencies (Pod Metrics for Inventory Tab)
Pods with No Limits

The number of pods with no limits (on CPU/memory) set. If you have specified limits on any pod that you are starting, this metric will tell you how many pods don't have a limit defined. Displayed in the Inventory Tab, under Pod Metrics.

Inventory > Pods > No LimitsHardware Resources|Cluster|Pods With No Limits
Pods With No Liveness ProbeThe number of pods with no liveness probe. If you've configured a probe in Kubernetes to monitor liveness, the values will be displayed in the Inventory Tab, under Pod Metrics.Inventory > Pods > No Probes -LivenessHardware Resources|Cluster|Pods With No Liveness Probe
Pods With No Readiness ProbeThe number of pods with no readiness probe. If you've configured a probe in Kubernetes to monitor readiness the values will be displayed in the Inventory Tab, under Pod Metrics.Inventory > Pods > No Probes -ReadinessHardware Resources|Cluster|Pods With No Readiness Probe
Privileged PodsThe number of privileged pods that run with root access. Displayed in the Inventory Tab, under Pod Metrics.Inventory > Pods > PrivilegedHardware Resources|Cluster|Privileged Pods


Storage errorsThe overall number of errors related to storage for the cluster.Inventory > Pod MetricsHardware Resources|Cluster|Storage errors
Storage quota violationsThe number of storage quota violations. If someone exceeds that quota.Inventory > Pod MetricsHardware Resources|Cluster|Storage quota violations


CPU

CPU Capacity

Metric NameDescriptionUI LocationMetric Path
Total (MilliCores)This metric depicts the total CPU capacity for the cluster in Millicores.Cluster Capacity > CPUHardware Resources|Cluster|CPU|Capacity|Total (MilliCores)
Used (MilliCores)This metric depicts the CPU capacity already used by the cluster in Millicores.Cluster Capacity > CPUHardware Resources|Cluster|CPU|Capacity|Used (MilliCores)

CPU Quota

Metric NameDescriptionUI LocationMetric Path
Limit Used (%)The percentage of CPU limit quota used.Dashboard > Quotas > CPU Limit

Hardware Resources|Cluster|CPU|Quota|Limit Used (%)

Limit Used (MilliCores)The Millicores value for CPU limit quota used.Dashboard > Quotas > CPU LimitHardware Resources|Cluster|CPU|Quota|Limit Used (MilliCores)
Request Used (%)The percentage of CPU request quota used.Dashboard > Quotas > CPU RequestHardware Resources|Cluster|CPU|Quota|Request Used (%)
Request Used (MilliCores)The Millicores value for CPU request quota used.Dashboard > Quotas > CPU RequestHardware Resources|Cluster|CPU|Quota|Request Used (Millicores)

CPU Utilization

Metric NameDescriptionUI LocationMetric Path
Limit (MilliCores)The limit of CPU which can be used by the pods. Only the pods belonging to monitored namespaces are considered to calculate this metric.Dashboard > Utilization > CPUHardware Resources|Cluster|CPU|Utilization|Limit (MilliCores)
Request (MilliCores)The Millicore value of CPU which all the pods in monitored namespaces have requested for.Dashboard > Utilization > CPUHardware Resources|Cluster|CPU|Utilization|Request (MilliCores)
Used (MilliCores)The actual CPU which the pods from monitored namespaces are currently using.Dashboard > Utilization > CPUHardware Resources|Cluster|CPU|Utilization|Used (MilliCores)

DaemonSets

Metric NameDescriptionUI LocationMetric Path
CountThe number of daemon sets that exist.Inventory > Objects > DaemonSets > (Count)HardwareResources|Cluster|DaemonSets|Count
Nodes AvailableThe number of nodes that are running and available on the cluster.Inventory > Objects > DaemonSets > AvailableHardwareResources|Cluster|DaemonSets|Nodes Available
Nodes MissScheduledThe number of nodes that are running, but shouldn't be running.Inventory > Objects > DaemonSets > MissScheduledHardwareResources|Cluster|DaemonSets|Nodes MissScheduled
Nodes UnavailableThe number of nodes that should be running, but are not running.Inventory > Objects > DaemonSets > UnavailableHardwareResources|Cluster|DaemonSets|Nodes Unavailable

Deployments

Metric NameDescriptionUI LocationMetric Path
CountThe number of deployments that exist in the cluster.Inventory > Objects > Deployments > (Count)HardwareResources|Cluster|Deployments|Count
ReplicasThe number of pod replicas in the cluster that are not in a terminated state.Inventory > Objects > Deployments > AvailableHardwareResources|Cluster|Deployments|Replicas
Replicas UnavailableThe total number of unavailable pod replicas across all deployments in the cluster.Inventory > Objects > Deployments > UnavailableHardwareResources|Cluster|Deployments|ReplicasUnavailable

Endpoints

Metric NameDescriptionUI LocationMetric Path
CountThe number of endpoints in the cluster.Inventory > Services > Endpoints > CountHardwareResources|Cluster|Endpoints|Count
Not Ready AddressThe total number of not ready addresses for all the endpoints in the cluster.Inventory > Services > Endpoints without ready IPHardwareResources|Cluster|Endpoints|Not Ready Address
OrphansThe total number of endpoints in the cluster which do not have any ready nor any not ready addresses.Inventory > Services > Orphan Endpoints with no IPHardwareResources|Cluster|Endpoints|Orphans
Ready AddressThe total number of ready addresses for all the endpoints in the cluster.Inventory > Services > EndpointsHardwareResources|Cluster|Endpoints|Ready Address

Jobs

Metric NameDescriptionUI LocationMetric Path

Count

The total number of jobs in the cluster.Inventory > Objects > Jobs > (Count)Hardware Resources|Cluster|Jobs|Count

Pods Active

The total number of active pods for all the jobs in the cluster.Inventory > Objects > Jobs > ActiveHardware Resources|Cluster|Jobs|Pods Active

Pods Failed

The total number of pods which reached phase Failed for all the jobs in the cluster.Inventory > Objects > Jobs > FailedHardware Resources|Cluster|Jobs|Pods Failed

Pods Succeeded

The total number of pods which reached phase Succeeded for all the jobs in the cluster.Inventory > Objects > Jobs > SucceededHardware Resources|Cluster|Jobs|Pods Succeeded

Memory

Memory Capacity

Metric NameDescriptionUI LocationMetric Path
Total (MB)This metric depicts the total Memory capacity for the cluster in MBs.Dashboard > Cluster Capacity > MemoryHardware Resources|Cluster|Memory|Capacity|Total (MB)
Used (MB)This metric depicts the Memory capacity already used by the cluster in MBs.Dashboard > Cluster Capacity > MemoryHardware Resources|Cluster|Memory|Capacity|Used (MB)

Memory Quota

Metric NameDescriptionUI LocationMetric Path
Limit Used (%)The percentage of Memory limit quota used.Dashboard > Quotas > Memory Limit

Hardware Resources|Cluster|Memory|Quota|Limit Used (%)

Limit Used (MB)The MB value for Memory limit quota used.Dashboard > Quotas > Memory Limit

Hardware Resources|Cluster|Memory|Quota|Limit Used (MB)

Request Used (%)The percentage of Memory request quota used.Dashboard > Quotas > Memory RequestHardware Resources|Cluster|Memory|Quota|Request Used (%)
Request Used (MB)The MB value for Memory request quota used.Dashboard > Quotas > Memory RequestHardware Resources|Cluster|Memory|Quota|Request Used (MB)

Memory Utilization

Metric NameDescriptionUI LocationMetric Path
Limit (MB)The limit of Memory which can be used by the pods. Only the pods belonging to monitored namespaces are considered to calculate this metric.Dashboard > Utilization > MemoryHardware Resources|Cluster|Memory|Utilization|Limit (MB)
Request (MB)The MB value of Memory which all the pods in monitored namespaces have requested for.Dashboard > Utilization > MemoryHardware Resources|Cluster|Memory|Utilization|Request (MB)
Used (MB)The actual Memory which the pods from monitored namespaces are currently using.Dashboard > Utilization > MemoryHardware Resources|Cluster|Memory|Utilization|Used (MB)

Nodes

Metric NameDescriptionUI LocationMetric Path
Master CountThe number of master nodes in the cluster.Inventory > MastersHardware Resources|Cluster|Nodes|Master Count
Worker CountThe number of worker nodes in the cluster.Inventory > WorkersHardware Resources|Cluster|Nodes|Worker Count
Memory Pressure CountThe number of nodes that are under memory pressure in the cluster.Inventory > Memory PressureHardware Resources|Cluster|Nodes|Memory Pressure Count
Disk Pressure CountThe number of nodes that are under disk pressure in the cluster.Inventory > Disk PressureHardware Resources|Cluster|Nodes|Disk Pressure Count

Pods

Pods Capacity

Metric NameDescriptionUI LocationMetric Path
Total CountThe total number of pods that a cluster can support.

Pods > Total Count

Hardware Resources|Cluster|Pods|Capacity|Total Count
Used CountThe number of pods already created in the cluster.Pods > CountHardware Resources|Cluster|Pods|Capacity|Used Count

PVC

PVC Quota

Metric NameDescriptionUI LocationMetric Path
UsedPVC quota already being used in the cluster. (count)Dashboard > Quotas > PVCHardware Resources|Cluster|PVC|Quota|Used
Used %Percentage of PVC quota already being used in the cluster.Dashboard > Quotas > PVCHardware Resources|Cluster|PVC|Quota|Used (%)

PVC Utilization

Metric NameDescriptionUI LocationMetric Path
Capacity (MB)The total PVC available for the pods in the monitored namespaces.Dashboard > Utilization > PVCsHardware Resources|Cluster|PVC|Utilization|Capacity (MB)
Request (MB)The value for PVC requested by pods in monitored namespaces.Dashboard > Utilization > PVCsHardware Resources|Cluster|PVC|Utilization|Request (MB)

ReplicaSets

Metric NameDescriptionUI LocationMetric Path
CountThe number of replica set resources in the cluster.Inventory > Objects > ReplicaSets > CountHardware Resources|Cluster|Count
ReplicasThe total number of replicas for all the replica sets in the cluster.Inventory > Objects > ReplicaSets > CountHardware Resources|Cluster|ReplicaSets|Replicas
Replicas AvailableThe total number of available replicas for all the replica sets in the cluster.Inventory > Objects > ReplicaSets > AvailableHardware Resources|Cluster|ReplicaSets|Replicas Available
Replicas UnavailableThe total number of unavailable replicas for all the replica sets in the cluster.Inventory > Objects > ReplicaSets > UnavailableHardware Resources|Cluster|ReplicaSets|Replicas Unavailable

Services

Metric NameDescriptionUI LocationMetric Path
CountThe total number of Kubernetes Services running in the cluster.Inventory > Services > ServicesHardware Resources|Cluster|Services|Count

Storage Quota

Metric NameDescriptionUI LocationMetric Path
Used (MB)The storage quota used by the cluster in MB.Dashboard > Quotas > StorageHardware Resources|Cluster|Storage|Quota|Used (MB)
Used (%)The percentage of storage quota used by the cluster.Dashboard > Quotas > StorageHardware Resources|Cluster|Storage|Quota|Used (%)