SLO Dashboard Design: Five Must-Haves
An SLO dashboard is the most-glanced surface in engineering. Design it to answer questions in seconds.
What the dashboard answers
Three questions in seconds: are we healthy now? Trending right? Burn-rate alert state?
Anything beyond requires drill-down; the dashboard catches the question.
Five must-have panels
- 1. SLO compliance percentage with target line.
- 2. Error budget remaining with sparkline.
- 3. Burn rate trend over multiple windows.
- 4. Top contributors to budget consumption.
- 5. Recent incidents tagged to budget impact.
Visual idioms
Big-number-with-trend for top metrics. Red/yellow/green for status. No charts that take more than a second to read.
Polished design beats baroque every time.
Incident-resilient layout
Dashboard URL bookmarked by on-call. Loads in <2s. Works in dark mode. Has clear sectioning.
Tested during a real incident; tuned after.
Antipatterns
- 20 panels. Overwhelming.
- Pretty visualisations that take effort to read. Slows incident response.
- Dashboard never reviewed for use. Drifts from utility.
What to do this week
Three moves. (1) Apply the pattern to your most-impactful service. (2) Measure adherence for 30 days. (3) Rewrite the policy or the SLO if the gap is durable.