Your First Grafana Dashboard
Three panels.
Overview
The first dashboard a team builds usually ends up as a wall of every metric the service emits. The version that earns its keep does the opposite: three panels, scoped to one service, linked from the on-call runbook.
- Three RED panels. Request rate, error rate, p99 latency. Three is the floor for usefulness and the ceiling for first-page comprehension.
- Service-scoped. One dashboard per service, owned by the team that operates it. Cross-service dashboards belong elsewhere.
- Environment variable. A
$envdropdown switches between staging and prod so the same dashboard serves both. - Deploy annotations and runbook link. CI publishes a deploy marker on the timeline and the on-call runbook links straight to the dashboard. Half of root-causes start with “was there a deploy?”
The approach
The approach borrows from the RED method, uses variables to scale across environments, and ships the dashboard as code so it survives the engineer who built it.
- RED method. Rate, Errors, Duration. The three signals that map directly to user experience.
- Variables for context.
$envand$namespacelet one dashboard serve every environment without copy-paste. - Deploy annotations. The CI pipeline pushes annotations on deploy. Time-correlation between deploy and impact is the cheapest investigation step.
- Documented purpose. A short description at the top: what this dashboard answers, what it does not. Future operators inherit context.
Why this compounds
The first dashboard is investment; the next ones reuse the template. Within a year the team has a consistent monitoring surface across every service it owns.
- Faster investigation. A dashboard that surfaces the right three numbers cuts root-cause time meaningfully on the incidents the team sees most often.
- Service understanding. Watching the dashboard during normal operation teaches the team how the service actually behaves, not how it was supposed to behave.
- Cross-team consistency. A shared template means every team’s dashboard reads the same way. New on-calls are productive on day one.
- Year-one investment, year-two habit. The first dashboard takes effort. By the third or fourth, the team has a template that scales without thought.