On-Call Runbook Quality

Quality scoring; refresh cadence.

Overview

On-call runbook quality measures and maintains runbook freshness. Runbook count is the easy metric; quality runbooks are what actually reduce MTTR. The library is only as useful as its worst-rated entry.

Quality scoring plus refresh cadence. Per-runbook quality score; the score determines whether on-call trusts the runbook.
Per-runbook freshness. Last-updated date per runbook; over 6 months is dated; over 12 months is suspect.
Per-incident runbook usage. Per-incident the runbook used; the data shows which runbooks are actually consulted.
Per-quarter refresh plus owner. Quarterly refresh catches drift; per-runbook owner supports accountability.

The approach

The practical approach: per-runbook owner named, quarterly refresh scheduled, per-incident usage tracked, documented policy. The team’s discipline produces fresh runbooks instead of a graveyard of stale documents.

Per-runbook owner. Each runbook has a named owner; supports accountability; "everyone’s runbook" rots into "no one’s."
Per-quarter refresh. Quarterly runbook refresh; the cadence catches drift before it becomes an incident-time discovery.
Per-incident runbook usage. Track which runbooks were consulted per incident; the data prioritises the next refresh.
Per-runbook freshness. Last-updated date visible; the on-call sees the freshness before trusting the runbook.
Document the policy. Per-team runbook policy committed to the handbook; supports operational reviews.

Why this compounds

Runbook quality discipline compounds across rotations. Each fresh runbook reduces MTTR; the team’s on-call maturity grows; new joiners onboard faster on a trustworthy library.

Better incident response. Fresh runbooks reduce MTTR; the on-call follows the runbook instead of investigating from scratch.
Better onboarding. Fresh runbooks support velocity; new on-callers ramp on a current library, not archaeology.
Better culture. Runbook quality signals that on-call matters; the team invests because the team uses the output.
Institutional knowledge. Each runbook teaches operational patterns; the team’s collective on-call expertise grows.

Runbook quality discipline is an operational discipline that pays off across years. Nova AI Ops integrates with runbook telemetry, surfaces patterns, and supports the team’s on-call discipline.