Related pages:

Health rules let you specify the parameters that represent what you consider normal or expected operations for your environment. The parameters rely on metric values, for example, the average response time for a business transaction or CPU utilization for a node.

When the performance of an entity affected by the health rule violates the rule's conditions, a health rule violation occurs. The health statuses are represented as critical, warning, normal, and unknown. 

When the health status of an entity changes, a health rule violation event occurs. Examples of a health rule violation include:

The health statuses of entities and health rule violations are surfaced in the controller user interface. A health rule violation event can also be used to trigger a policy, which can initiate automatic actions, such as, sending alerting emails or running remedial scripts.

You create health rules using the health rule wizard, described in Configure Health Rules. The wizard groups commonly-used system entities and related metrics to simplify setting up health rules. You can also use the default health rules as-is, or modify them.

Default Health Rules

provides a default set of health rules for some products, such as applications and servers. These default health rules vary depending on the entity. To see the default rules, before any health rules have been added to your  installation:

  1. From the Alert & Respond tab, click Health Rules.
  2. Select the entity.
    The default health rules for the selected entity are displayed.

If any of these predefined health rules are violated, the affected entities are marked in the UI as yellow-orange if it is a Warning violation and red if it is a Critical violation.

In many cases, the default health rules may be the only health rules that you need. You can edit and customize the health rules to suit your application. You can also disable the default health rules.

Health Rule Scopes

The health rule scope determines the set of default health rule types. You can choose the scope to get a set of default health rule types for applications, servers, or databases. For example, when you define a mobile application as the scope, the default health rules such as crash rates and HTTP/network error rates are displayed. Similarly, if you define the health rule scope for an application, the health rules would be for business transactions, CPU/memory utilization, and so on.

From Alert & Respond > Health Rules, you can select one of the following health rule scopes from the drop-down list:

You can also create new health rules to add to the default set for each scope. You may want to add the health rule app starts to your mobile application. This health rule is not part of the default set of health rules in the mobile app scope, so you would just need to add a new health rule.

Heath Rule Types

The health rule wizard groups health rules into types that are categorized by the entity that the health rule covers. This allows the wizard to display appropriate configuration items during the health rule creation.

The following table lists various health rule types.

Health Rule TypeDescription
Transaction Performance
  • Overall Application Performance: Groups metrics related to load, response time, slow calls, stalls, with applications.
  • Business Transaction Performance: Groups metrics related to load, response time, slow calls, stalls, so on, with business transactions.
Node Health
  • Node Health-Hardware, JVM, CLR: Groups metrics like CPU and heap usage, disk I/O, so on, with nodes.
  • Node Health-Transaction Performance: Groups metric related to load, response time, slow calls, stalls, so on, with nodes.
  • Node Health-JMX: Java only, groups metrics related to connection pools, thread pools, so on, with specific JMX instances and objects in specific nodes and tiers.

User Experience-Browser Apps

  • Pages: Groups metrics like DOM building time, JavaScript errors, so on, with the performance of application pages for the end-user.
  • IFrames: Groups metrics like first-byte time, requests per minute, so on, with the performance of iFrames for the end-user.
  • AJAX Requests: Groups metrics like Ajax callback execution time, errors per minute, so on, with the performance of Ajax requests for the end-user.
  • Virtual Pages: Groups metrics like End User Response Time, Digest Cycles, HTML Download Time, DOM Building Time, etc. for virtual pages created with Angular. See AngularJS Support for information on what these metrics mean in the context of virtual pages.

User Experience-Mobile Apps

  • Mobile Apps: Groups metrics related to mobile app crashes, starts, and server calls as well as network requests and errors.
  • Network Requests: Groups metrics like HTTP and network errors, request time, and requests per minute with network requests.

Event Storage- Overage Monitoring

  • Browser Events: Groups metrics that measure the storage consumption limit for the browser events.
  • Mobile Events: Groups metrics that measure the storage consumption limit for the mobile events.
  • Analytics Events: Groups metrics that measure the storage consumption limit for the analytics events.

The following metrics are available for the events:

  • Event Storage Utilized (%)
  • Event Storage Available (%)
  • Event Storage Utilized (GB)
  • Event Storage Available (GB)
ServersGroups metrics related to hardware resources.
Databases & Remote ServicesGroups metrics related to response time, load, or errors with databases and other backends.
Advanced NetworkGroups metrics related to Network Visibility, such as PIE (performance impact events), zero window, data retransmission, and errors.
Error RatesGroups metrics related to exceptions, return codes, and other errors with applications or tiers.
Information PointsGroups metrics like response time, load, or errors with information points.
Service EndpointsJava and .NET only; groups metrics like average response time, calls per minute, and errors per minute with service endpoints.
Custom

Presents all the metrics collected by the agent that could affect a single business transaction, a single node or overall application performance. Use this type to create rules that evaluate custom metrics.

When you select one of these health rule types, the wizard offers you the metrics commonly associated with that type in an embedded browser.

How to Set Up Health Rules?

 recommends the following process to set up health rules for your application:

  1. Identify the key metrics (performance indicators) on the key entities that you need to monitor.
  2. Click Alert & Respond > Health Rules to examine any default health rules.
  3. If default health rules do not cover all your requirements or if you need finely-applied health rules to cover specific use cases, create new health rules.
    1. Identify the type of health rule that you want to create. See Health Rule Types.
    2. Decide which entities are affected by the new rule. See Entities Affected by a Health Rule.
    3. Define the conditions to monitor. See Create and Configure Conditions.
  4. If you want the health rules to be evaluated according to a pre-defined time schedule, create a health rule schedule. In some situations, a health rule is more useful if it is evaluated at a particular time. See Health Rule Schedules.

After you set up health rules you must configure policies and actions to be executed when health rules are violated. See Policies and Actions.

Additional Considerations