SLO & Reliability Practical By Samson Tanimawo, PhD Published Aug 17, 2025 4 min read

SLOs on ML Services

ML adds quality dimension to SLOs.

Dimensions

Standard SLO frameworks were built for stateless request-response services where success is mostly about availability and latency. Machine learning services break that frame. A model that responds in 100ms with the wrong answer is not "available" in any meaningful sense; the inference completed but the result is incorrect. The fix is multi-dimensional SLOs that include quality alongside the conventional dimensions.

What ML SLO dimensions actually cover:

The dimensional model for ML SLOs is what turns "the model is up" into a meaningful claim about service quality. Without quality in the SLO, the team's reliability story is dangerously incomplete.

Track quality

The hard part of ML SLOs is operationalizing the quality dimension. Latency and errors come from the metric pipeline; quality requires labeled data, eval pipelines, and ongoing measurement against a moving production distribution. The discipline is more involved but the techniques are well-established.

Quality tracking turns ML SLOs from theory into practice. Without it, the quality dimension is asserted but not measured, and the team finds out about regressions from customer complaints rather than from the dashboard.

Compound

Multi-dimensional ML SLOs require more instrumentation than single-dimensional service SLOs. The investment is justified because the workload's failure modes are themselves multi-dimensional. The compound view is what makes the SLO useful.

Multi-dimensional ML SLOs are the discipline that makes ML systems operationally honest. Nova AI Ops integrates with eval pipelines, tracks quality alongside latency and errors per ML service, and surfaces the per-segment quality breakdown so the team can see where the model is actually working and where it is not.