Menu
Grafana Cloud

Troubleshoot an SLO breach using Grafana Cloud Asserts

This topic show you how to interpret an error budget burn down chart and use Asserts to troubleshoot an SLO breach.

Before you begin

Before you begin, ensure that you have defined an Asserts SLO.

Steps

To use Asserts to troubleshoot an SLO breach, perform the following steps:

  1. Sign in to Grafana Cloud and click Asserts > SLO.

  2. Expand Objective for the SLO you want to investigate.

  3. Use the following table to interpret the Error Budget Burndown panel.

    NumberElementDescription
    1Target vs ActualShows the target SLO compared to the actual SLO.
    2Incidents in WindowCounts the number of SLOs incidents that occurred during the compliance window defined with the SLO was created.
    3Budget UsedThe amount of budget used expressed as a number. If the number is greater than 1, then more than 100% of error budget has been used.
    4Recent Budget UsageThe error burn rate calculated over a recent, specific time window. For example, calculating the error burn rate over the last hour gives you a sense of how quickly you’re burning through your error budget.
    5Current Incident StatusShows an icon that indicates whether error budget is currently being consumed.
    6Events QueryShows the Bad and Total Events Query used to calculate the SLO.
    7Error Budget Burndown chartThe yellow dashed line indicates the ideal error budget burn down rate. The green line indicates the actual burn down rate. In this example, the error budget remains untouched until the end of the compliance window, when there is consistent and dramatic use of the error budget.

    Error Budget Burndown panel showing an overview of an SLO

  4. Scroll down the page and review the SLI Zoomed In panel.

    In this example, you can see a large spike in error budget usage.

    SLI chart showing spike in error budget usage

  5. In the Error Budget Burndown panel, click and drag your cursor to select the time range you want to investigate and click Open in RCA workbench.

    The Open in RCA workbench button appears after you have added a search expression in the RCA workbench Context section while creating the SLO.

    Error Budget Burndown panel showing selected time range

  6. Use RCA workbench to explore entities and assertions.

    For more information about RCA workbench, refer to Perform root cause analysis in RCA workbench.

    RCA workbench showing services related to an SLO breach