SLO Org Alignment
SLOs across the org consistent.
The misalignment problem
Different teams use different SLO definitions. Team A’s 99.9% measures one thing; Team B’s 99.9% measures another; comparison is impossible and rollups are meaningless. Cross-service SLOs become arithmetic chaos, and stakeholders lose trust in the numbers.
- Inconsistent definitions. Team A’s 99.9% measures one thing, Team B’s 99.9% measures another; comparison is impossible.
- Rollups become meaningless. Aggregate SLOs across teams cannot be computed when the underlying definitions differ.
- Dependency math breaks. Cross-service SLOs in service-A-depends-on-service-B chains cannot be reconciled.
- Stakeholder trust erodes. When two teams report different reliability for the same flow, stakeholders ignore the numbers.
The standard definition
Standardisation is the prerequisite for organisational SLOs. The SLI primitives, time windows, and units must mean the same thing across every team; without that contract, every later step (rollup, dependency math, stakeholder reporting) breaks down.
- SLI primitives. Success ratio, latency percentile, error rate; each defined unambiguously.
- Success definition. A request that returned 2xx within latency budget; an error is anything else.
- Standard time windows. 30-day rolling for monthly SLOs; calendar-aligned for quarterly reporting.
- Standard units. Latency in milliseconds, percentiles as decimals; no mixing seconds and milliseconds across teams.
Cross-team review forum
Standardisation needs a forum. Monthly review with reps from each team surfaces drift; quarterly recalibration adjusts targets as the product and industry shift; documented decisions give new engineers the rationale and future debates the prior context.
- Monthly review. Reps from each team; compare apples to apples; surface drift in definitions; align on changes.
- Quarterly recalibration. Industry benchmarks shift, product needs evolve, SLO targets follow.
- Documented decisions. SLO definition changes have a record; new engineers understand the rationale.
- Per-decision rationale. Future debates reference the prior decisions; the forum compounds organisational memory.
Aggregating to org-wide SLOs
Per-team SLOs feed product-level SLOs. The checkout flow involves five services; the flow SLO is a function of the service SLOs. Serial-chain SLO is the product of component SLOs; parallel-chain SLO depends on whether one failure breaks the flow.
- Per-flow rollup. Per-team SLOs feed product-level SLOs; the checkout flow SLO is a function of its services.
- Serial-chain math. SLO of a serial chain is the product of component SLOs; the dependency math is multiplicative.
- Parallel-chain math. Depends on whether one failure breaks the flow; the math differs from serial.
- Tooling support. Nobl9 and similar SLO platforms model dependencies; homegrown setups need careful PromQL or custom code.
When alignment breaks down
Persistent disagreement on definitions usually reflects different stakeholder priorities. Escalation to engineering leadership resolves with policy; tooling fragmentation amplifies misalignment, so picking one SLO platform across the org is high-leverage even when migration is painful.
- Persistent disagreement. Different stakeholder priorities; escalate to engineering leadership; resolve with policy.
- Tooling fragmentation. Multiple SLO platforms amplify misalignment; one platform across the org is high-leverage.
- The forum is the system. Without it, drift accumulates; with it, alignment compounds across the org.
- Per-org SLO policy. Documented standard committed to engineering handbook; supports onboarding and dispute resolution.