SLO & Reliability Practical By Samson Tanimawo, PhD Published Sep 23, 2025 4 min read

SLOs and On-Call Pages

On-call should map to SLO breach.

Rule

Most teams page their on-call on every error, every threshold breach, every alert that fires anywhere in their monitoring system. The result is alert fatigue: the on-call wakes up at 3am for a 30-second blip that did not affect any customer, learns to ignore the next page, and misses the real incident three weeks later. SLO-aware on-call replaces this pattern with a simple rule: pages fire when the SLO is at risk, not when any individual error happens.

What this rule actually means:

The rule is simple to state and harder to implement. The implementation requires real SLO instrumentation; the cultural change is bigger than the technical one.

Reduces noise

The benefit of SLO-aware on-call is dramatic noise reduction. Teams that adopt this pattern routinely report that page volume drops by 60 to 80% within a quarter. The pages that remain are real; the pages that went away were noise.

The noise reduction is the visible benefit. The cultural shift it enables is the deeper benefit.

Signal

The flip side of noise reduction is signal preservation. The pages that fire are pages that should have fired. Each page is a real signal that warrants real response. The on-call's trust in the page system is the foundation of effective incident response.

SLO-aware on-call is the pattern that turns alerting from a noise generator into a signal system. Nova AI Ops integrates SLO-aware burn-rate alerts with on-call routing, surfaces the per-page response data so the team can see whether the noise reduction is working, and tracks the on-call burden over time so the operational sustainability is measurable.