Related pages: |
Health rules let you specify the parameters that represent what you consider normal or expected operations for your environment. The parameters rely on metric values, for example, the average response time for a business transaction or CPU utilization for a node.
When the performance of an entity affected by the health rule violates the rule's conditions, a health rule violation occurs. The health statuses are represented as critical, warning, normal, and unknown.
When the health status of an entity changes, a health rule violation event occurs. Examples of a health rule violation include:
The health statuses of entities and health rule violations are surfaced in the controller user interface. A health rule violation event can also be used to trigger a policy, which can initiate automatic actions, such as, sending alerting emails or running remedial scripts.
You create health rules using the health rule wizard, described in Configure Health Rules. The wizard groups commonly-used system entities and related metrics to simplify setting up health rules. You can also use the default health rules as-is, or modify them.
provides a default set of health rules for some products, such as applications and servers. These default health rules vary depending on the entity. To see the default rules, before any health rules have been added to your
installation:
If any of these predefined health rules are violated, the affected entities are marked in the UI as yellow-orange if it is a Warning violation and red if it is a Critical violation.
In many cases, the default health rules may be the only health rules that you need. You can edit and customize the health rules to suit your application. You can also disable the default health rules.
The health rule scope determines the set of default health rule types. You can choose the scope to get a set of default health rule types for applications, servers, or databases. For example, when you define a mobile application as the scope, the default health rules such as crash rates and HTTP/network error rates are displayed. Similarly, if you define the health rule scope for an application, the health rules would be for business transactions, CPU/memory utilization, and so on.
From Alert & Respond > Health Rules, you can select one of the following health rule scopes from the drop-down list:
You can also create new health rules to add to the default set for each scope. You may want to add the health rule app starts to your mobile application. This health rule is not part of the default set of health rules in the mobile app scope, so you would just need to add a new health rule.
The health rule wizard groups health rules into types that are categorized by the entity that the health rule covers. This allows the wizard to display appropriate configuration items during the health rule creation.
The following table lists various health rule types.
Health Rule Type | Description |
---|---|
Transaction Performance |
|
Node Health |
|
User Experience-Browser Apps |
|
User Experience-Mobile Apps |
|
Event Storage- Overage Monitoring |
The following metrics are available for the events:
|
Servers | Groups metrics related to hardware resources. |
Databases & Remote Services | Groups metrics related to response time, load, or errors with databases and other backends. |
Advanced Network | Groups metrics related to Network Visibility, such as PIE (performance impact events), zero window, data retransmission, and errors. |
Error Rates | Groups metrics related to exceptions, return codes, and other errors with applications or tiers. |
Information Points | Groups metrics like response time, load, or errors with information points. |
Service Endpoints | Java and .NET only; groups metrics like average response time, calls per minute, and errors per minute with service endpoints. |
Custom | Presents all the metrics collected by the agent that could affect a single business transaction, a single node or overall application performance. Use this type to create rules that evaluate custom metrics. |
When you select one of these health rule types, the wizard offers you the metrics commonly associated with that type in an embedded browser.
recommends the following process to set up health rules for your application:
If the default health rules cover all the key metrics you need, determine if the pre-configured conditions are applicable to your environment. If required, modify the conditions.
Define a metric expression to evaluate complex criteria for a condition. Define a boolean expression to evaluate multiple health rule conditions. |
After you set up health rules you must configure policies and actions to be executed when health rules are violated. See Policies and Actions.
Your application status is based on health rules for the current time range. If you disable old health rule policies or enable new ones, you might see errors in red in your application status, even if there are no current critical events based on the new policies. To verify that your new or disabled health rule policies have taken effect, change the time range in your dashboard to a smaller, more recent time frame.