When you open the dashboard you are presented with a summary of your application's performance based on the current state of the SLOs you have configured. This summary shows you everything that requires attention either immediately, or in the near future. This section contains three widgets:
- Currently Impacted SLOs
- Low Error Budget SLOs
- Active Alerts
This widget shows you the number of SLOs that are being impacted at the present moment, meaning SLOs that are actively consuming error budget. Impact is determined based on the SLOs current error budget burn rate.
The widget displays impact in two distinct categories: high and low impact. Highly impacted SLOs are characterised by fast burning error budgets while low impact SLOs are defined by slow burning error budgets. This allows you to get an immediate sense of any issues that might be currently impacting your application and of how severe they might be. You can then use this information to address those issues before they become major problems.
Currently Impacted SLOs widget
This widget shows you the number of SLOs that are in a Non-Compliant and In Danger state. These states are determined by the level of remaining error budget of each SLO. SLOs become non-compliant when their error budget has been fully depleted and are considered In Danger when their remaining error budgets are under 25%.
Having this information allows you to understand if any SLOs have fully consumed their error budget, or if they are at risk of doing so. This allows you to identify areas of your application that might need attention in the present, or near future and plan for reliability work to be executed ahead of time.
Low Error Budget SLOs widget
The Active Alerts widget displays the total number of currently active alerts triggered by any of the alert policies you might have assigned to their SLOs. This tells you if any alert conditions are being breached in the present moment and makes you aware of any problems that are currently impacting your application. With this information, you can follow up on the incident management process within your preferred incident management tool and make sure the alerts are being handled properly by the right people.
This widget is displayed as a doughnut chart. In the center of the chart, the total number of active alerts is displayed. This total is then distributed into three distinct categories represented by the different sections of the doughnut. These categories are based on the state of the SLOs that are triggering the alerts. Alerts can be associated to SLOs that are in a Compliant, In Danger or Non-Compliant state. Alerts triggered by Non-Compliant SLOs will have a greater impact on your reliability goals than alerts triggered by Compliant SLOs since they still have error budget left.
Active Alerts widget