Intermediate By Samson Tanimawo, PhD Published Sep 29, 2026 5 min read

Error Budget Formulas Cheat Sheet

Every error-budget calculation with the PromQL beside it. Designed to live next to your alert rules file, not in a doc no one reads.

The core formulas

Three numbers, that's it. Once these are computed, everything else is policy.

Budget remaining

The number on the dashboard. Negative means you've already missed; pick the response policy below.

PromQL for a 99.9% availability SLO over 30 days:

Burn rate

How fast the budget is being spent right now. The reference table is fixed by SLO window length and is worth memorising.

PromQL: (sum(rate(http_requests_total{status=~"5.."}[1h])) / sum(rate(http_requests_total[1h]))) / (1 - 0.999)

Multi-window alerts

Single-window burn alerts flap. Pair a long window (catches sustained burns) with a short window (catches fresh burns). Alert fires only when both fire, kills 90% of false pages.

PromQL pattern (fast-burn):

Latency budgets

Treat slow as bad. Define a "good" event as latency ≤ threshold, and the same formulas apply.

Policy thresholds

The numbers turn into action. Make these explicit so on-call doesn't have to negotiate them at 3am.