SLO & Reliability Practical By Samson Tanimawo, PhD Published Oct 10, 2025 4 min read

Multi-Region SLO Rollup

Aggregate region SLOs into global.

Math

The naive way to compute a global SLO from regional ones is to take the average. It is also wrong, in a way that almost guarantees you will report a number that does not match user experience. The right way is a traffic-weighted aggregate, and the difference between the two can be the difference between a healthy report and a hidden customer-impacting outage.

Why straight averaging breaks:

The traffic-weighted aggregate is the only number that matches what customers actually experienced. It is also the only number that survives audit when a stakeholder asks "show me the math behind that number."

Display

Once the math is right, the dashboard has to reflect both the global picture and the regional decomposition. A dashboard that shows only the global number hides regional issues. A dashboard that shows only regional numbers makes it impossible to answer "are we hitting our overall commitment."

The display layer is what turns the rollup math from a single number into a tool the team can actually act on.

Alert

Alerting on a global rollup is exactly the case where you will sleep through a regional outage. The alert thresholds have to be region-aware so a single region's failure pages someone, even when the traffic-weighted global metric stays green.

Multi-region SLO rollups done right give you both the executive headline and the operator's working surface. Nova AI Ops computes traffic-weighted global SLOs across regions, surfaces per-region drilldown on the same dashboard, and routes alerts to the right on-call by region so a regional issue does not need a global page to wake the right person.