Latency Budgets per Service: The Math That Holds

End-to-end latency goals decompose into per-service budgets. The math is simple; the discipline of tracking is rare.

Why budgets per service

End-to-end latency goals do not enforce themselves. Without per-service decomposition, no team owns their fraction; the user-facing goal slips.

Four-step decomposition

Tracking pattern

The budget only works if every service team can see their slice. Per-service dashboard with the budget line drawn; quarterly re-allocation as the architecture changes.

Renegotiation cadence

When a service breaches its budget, the conversation must happen. Either invest in that service or renegotiate the budget; doing nothing makes the SLO a lie.

Antipatterns

What to do this week

Three moves. (1) Apply this pattern to your slowest production endpoint. (2) Measure p99 before/after. (3) Document the win and ship the runbook so the team can reproduce.