Service Health Matrix is the reliability roll-up. One row per service, one column per SLO, color-coded compliance. Click any cell to drill into the underlying SLI, the burn-rate window, and the active alerts. Designed for the morning standup and the 3am page alike.
Each row is a service, each column is one of the SLOs you have defined for it (latency, availability, error budget, saturation, freshness). The cell color is computed from the live SLI compared to the SLO target over the rolling 30-day window. Green is on track, yellow is burning fast, red is over budget.
Each cell tracks burn rate across short and long windows (1h, 6h, 24h, 30d) so a brief blip does not flip the cell red and a slow drift does not stay green forever. The cell shows the burn-rate ratio (how many times faster than budget the service is burning) and the projected exhaustion date.
Filter by team, by tier (tier-0 / tier-1 / tier-2), by environment, by region. Roll up by team to get a one-row view per team. Drill down into any cell to see the underlying SLI definition, the queries that compute it, and the alerts that fire on burn-rate breaches.
When a cell crosses the red threshold (over budget, fast-burn breach), Nova fires an alert into the On-Call rotation for the owning team. The alert payload includes the service, the SLO, the burn rate, the projected exhaustion, and a deep link back to the cell. No "where do I go?" guessing.
Subscribe to Nova AI Ops on YouTube for demos, tutorials, and feature deep-dives.
Most reliability tools show a single org-wide score. Service Health Matrix shows the truth: which service, which SLO, by how much.