To ensure that you do not miss or receive false alerts, you must configure alerts properly. Alert Sensitivity Tuning (AST) helps you configure alerts with sensitivity by providing historical data for the metric or the baseline being configured. AST also visualizes the impact of the alert configuration.

Create a Health Rule and Fine-tune Metric Evaluation

You can create a health rule to monitor the parameters of an application entity and fine-tune the sensitivity of a health rule using AST.

  • You must be a SaaS customer.
  • You must be monitoring a business transaction, a service endpoint, or a remote service (affected entity).
  1. In the entity (BT, SEP, or remote services) UI, select an entity from the list.
  2. Right-click the entity and click Create new Health Rule. The Create Health Rule UI displays.
  3. Configure the health rule overview details. For more information, see Configure Heath Rule Details.
  4. In the Affected Entities panel, select the entities to monitor.
  5. In the Critical Criteria panel, select +Add Condition
  6. Configure the health rule condition.
  7. If required, select the Warning Criteria panel and define warning conditions as described in step 6. 
  8. Click Save to save the health rule.

Configure the Health Rule Condition

  1. Enter a name for the condition in the first field of the condition row. This name is used in the generated notification text and in the AppDynamics Console to identify the violation.
  2. From the dropdown next to the Condition name field, select the single metric or metric expression to evaluate the condition.
    1. Single Metric:

      Qualifier Type

      Description

      Minimum

      The minimum value reported across the configured evaluation time length. Not all metrics have this type.

      Maximum

      The maximum value reported across the configured evaluation time length.  Not all metrics have this type.

      Value

      The arithmetic average of all metric values reported across the configured evaluation time length. This value is based on the type of metric.

      Sum

      The sum of all the metric values reported across the configured evaluation time length.

      Count

      The number of times the metric value has been measured across the configured evaluation time length.

      Group CountThe number of nodes contributing to a metric value, generally relevant for application or tier level metrics.

      Current

      The value for the current minute.

      1. From the Value dropdown, select a qualifier for the metric from these following options:

        • Click Select a Metric. Metric Selection dialog displays. The metric browser in the Metric Selection dialog displays metrics appropriate to the health rule.

        • Select a metric from the list for the business transaction. Click Select Metric
          A graphical view of the metric data for the last 8 hours displays.

          To view the metric data for one day or three days, select the time period using the dropdown.

      1. From the dropdown after the metric, select the type of comparison to evaluate the metric.

        • To limit the effect of the health rule to conditions during which the metric is within a defined range—standard deviations or percentages—from the baseline, select Within Baseline from the dropdown. 
          To limit the effect of the health rule to when the metric is not within that defined range, select Not Within Baseline, and then select:
           - baseline to use 
           - numeric qualifier of the unit of evaluation
           - unit of evaluation
          For example:

          Within Baseline of the Default Baseline by 3 Baseline Standard Deviations
        • To compare the metric with a static literal value, select < Specific value or > Specific Value from the menu, and then enter the specific value in the text field. For example:

          Value of Errors per Minute > 100
        • To compare the metric with a baseline, select < Baseline or > Baseline from the dropdown, and then select:
           - baseline to use
           - numeric qualifier of the unit of evaluation
           - unit of evaluation. 
          You can use the Baseline Standard Deviation or Baseline Percentage as the unit of evaluation. For example:

          Maximum of Average Response Time is > Baseline of the Daily Trend by 3 Baseline Standard Deviations

          See Dynamic Baselines for information about the baseline options.

          Baseline Percentages

          The baseline percentage is the percentage above or below the established baseline at which the condition will trigger. For example, if you have a baseline value of 850 and you have defined a baseline percentage of > 1%, the condition is true if the value is > [850+(850x0.01)] or 859.

          To prevent health rule violations from being triggered when the sample sets are too small, these rules are not evaluated if the load—the number of times the value has been measured—is less than 1000. For example, if a very brief time slice is specified, the rule may not violate even if the conditions are met, because the load is not large enough.
          Depending on the baseline configuration you define, a graphical view of the metric data for the given baseline configuration displays. The graphical view instantly updates when you update any configuration. You can also view granular details by zooming in on the graphical view. The metric data in the graph helps you calibrate the sensitivity of the metric evaluation.

    2. Metric Expression:
      1. Select the Metric Expression option from the dropdown and click Add Expression. The Metric Expression panel displays enabling you to construct a mathematical expression to use as a metric. For example, the following expression is created to measure the number of calls per CPU: Value of {calls} {cpu} > 1.

      2. In the Variable Declaration panel of the Mathematical Expression builder, click + Add variable.

      3. In the Variable Name field, enter a name for the variable.
      4. From the dropdown next to the Variable Name field, select the qualifier for the metric from these options:

        Qualifier Type

        Description

        Minimum

        The minimum value reported across the configured evaluation time length. This qualifier is not available for all the metrics.

        Maximum

        The maximum value reported across the configured evaluation time length. This qualifier is not available for all the metrics.

        Value

        The arithmetic average of all metric values reported across the configured evaluation time length. This value is based on the type of the metric.

        Sum

        The sum of all the metric values reported across the configured evaluation time length.

        Count

        The number of times the metric value has been measured across the configured evaluation time length.

        Group CountThe number of nodes contributing to a metric value, generally relevant for application or tier level metrics.

        Current

        The value for the current minute.

      5. Click Select a metric to open an embedded metric browser. To avoid erroneous evaluations, if any metric in the expression has a null value, the health rule is not evaluated.

        ExpressionNull ValueEvaluation
        a-b-c
        a
        entire expression is evaluated negative
        a/bb
        the number 'a' is divided by zero, evaluates to an error
        a*b
        a or b
        entire expression is evaluated as zero
      6. Repeat Steps i through v for every metric that you use in the expression. You can remove a variable by clicking Delete.
      7. In the Expression pane, build the expression by clicking Insert Variable to insert variables created in the Variable Declaration pane along with appropriate mathematical signs.
      8. When the expression is built, click Save.

      9. From the Critical Criteria tab, select < Specific value or > Specific Value from the pane to compare the value of the metric evaluation with a static literal value, and then enter the specific value in the text field. For example:

        Value of {calls} / {cpu} > 1
        YML

        A graphical view of the metric data for the last eight hours displays.

        You can view the metric data for one day or three days by selecting the time period using the dropdown.

  3. If you want the condition to evaluate to true when a configured metric does not return any data during the evaluation time frame, select the Evaluate to true on no data option. 
    This option does not affect the evaluation of the unknown in the case where there is no enough data for the rule to evaluate. For example, if the health rule is configured to evaluate the last 30 minutes of data and a new node is added, the condition evaluates to unknown for the first 30 minutes even if the Evaluate to true on no data option is selected.

  4. If you want to define a 'Persistence Threshold' for the condition to reduce false alerts: 

    1. Select Trigger only when a violation occurs __ times in the last __ min(s).

    2. Define the number of times metric performance data should exceed the defined threshold to constitute a violation.

    3. If required, adjust the evaluation time frame by setting an alternate evaluation time frame.

    • You can define a persistence threshold for a condition only if you have defined an evaluation time frame of 30 minutes or less.
    • If you define a persistence threshold for a condition, the metric data is plotted directly on the AST graph. If you do not define a persistence threshold, a 'moving average' for the selected metric is plotted. For more information, see Why Use Moving Average.
  5. To specify the evaluation scope in the Critical Criteria and Warning Criteria panels, select the average of all nodes.

    AST graphical view displays only when you set the health rule evaluation scope to the average for all nodes.