AI Agent Operations

One incident, not eighty pages,
related alerts grouped automatically

Incident Correlation Engine groups related alerts into single incidents. When 80 services start erroring because the database is slow, you get one page about "database degraded," not 80 service pages. The engine uses the service graph, time proximity, and shared root signals to identify related alerts; ungrouping is one click if the engine got it wrong.

Get Started Talk to Sales
app.novaaiops.com / incident-correlation-engine
● LIVE
~80%
noise reduction
Service-graph
aware grouping
Ungroup
one click
Live
as alerts arrive
Three Grouping Signals

Service graph, time, shared symptom

The engine uses three signals to decide if two alerts belong together. (1) Service-graph proximity: alerts on services connected by the live service map are likely related. (2) Time proximity: alerts within 60 seconds of each other on connected services are very likely related. (3) Shared symptom: alerts citing the same upstream cause (database, redis, queue) consolidate.

  • Service-graph proximity: connected services with simultaneous alerts get grouped, graph is the prior
  • Time proximity: within-60s arrival on related services is treated as the same root event
  • Shared symptom: when alerts cite the same upstream (e.g., "rds slow"), they consolidate even across distant services
app.novaaiops.com / incident-correlation-engine · signals
Ungrouping

When the engine is wrong, one click

Sometimes two genuinely separate incidents fire at the same time. The engine may group them; the operator can split. One click ungrouping splits any group into two new incidents and re-routes the on-call accordingly. The split is recorded so the engine can learn from it.

  • One-click split: select alerts to move to a new incident; routing rebuilds automatically
  • Learning loop: splits feed back into the model so the engine gets better over time
  • Audit logged: every split records who split, why, and when, for postmortem review
app.novaaiops.com / incident-correlation-engine · ungroup
Tunable Aggressiveness

Group more or group less, your call

Default tuning is moderate: prefer grouping when the signals are strong. You can dial more aggressive (group at weaker signals, fewer pages, more risk of grouping unrelated incidents) or less aggressive (group only at strong signals, more pages, less risk of bundling). Per-tenant config; ships with sensible defaults.

  • Three modes: aggressive, moderate (default), conservative, tunable per tenant
  • Visible tradeoff: each mode shows expected page count vs split risk on the page
  • A/B-able: try a different mode for a week; the page reports whether grouping precision improved or got worse
app.novaaiops.com / incident-correlation-engine · tuning
Reporting

Noise reduction as a number

Weekly report: alerts fired, incidents formed, noise-reduction percentage, mis-group rate (from operator splits), and trend. Use the report to defend the value of the platform when finance asks "what does Nova actually save us?", pager-fatigue costs are a real number, not just a vibe.

  • Weekly digest: alerts, incidents, reduction %, mis-group rate, emailed Monday
  • Trend: reduction % over time; tightening configs improves it
  • Audit-friendly: every grouping decision is logged; auditors can verify nothing important was bundled away
app.novaaiops.com / incident-correlation-engine · report
Video walkthrough coming soon

Subscribe to Nova AI Ops on YouTube for demos, tutorials, and feature deep-dives.

Stop paging on every alert

Pager fatigue is a major MTTR driver. Correlation pages once per incident, not per alert.

Get Started Request a Demo