Auto Remediation & Runbooks

954 runbooks that fix incidents
while your team sleeps

Nova Auto Remediation ships with 954 pre-built runbooks covering the most common infrastructure and application failures. When a known pattern is detected, a memory leak, a stuck queue, a certificate expiring, Nova executes the matching runbook automatically. Rollbacks, scaling actions, and service restarts happen in under 90 seconds, and every action is logged in a full audit trail.

Start Free Trial Watch Demo
app.novaaiops.com · Auto Remediation
● LIVE
Nova AI Auto Remediation
954
Pre-built runbooks
78%
Auto-resolved incidents
<90s
Mean time to remediate
Safe
What-if simulation
Pre-Built Runbooks

954 runbooks for every failure pattern you will encounter

Nova ships with a library of 954 runbooks built from real-world incident patterns across thousands of production environments. Each runbook defines the detection trigger, the remediation steps, validation checks, and rollback procedures. Runbooks cover Kubernetes pod restarts, database connection pool exhaustion, disk space cleanup, certificate renewal, memory leak mitigation, and hundreds more.

  • Pattern matching: runbooks trigger automatically when Nova detects a matching failure signature across your telemetry
  • Custom runbooks: extend the library with your own runbooks using a visual editor or YAML definitions
  • Version controlled: every runbook change is versioned and auditable, with approval workflows for production runbooks
app.novaaiops.com · Runbook Library
Pre-built runbooks
Auto-Execute & Rollback

Fix it automatically, roll it back if anything goes wrong

When a runbook triggers, Nova executes the remediation steps in sequence: diagnose, act, validate, close. If validation fails, say the service does not recover after a restart, Nova automatically rolls back and escalates to a human. Every action is executed in a sandboxed context with full rollback capability, so auto-remediation is safe even for production-critical services.

  • Automatic rollbacks: if remediation does not resolve the issue, Nova reverts all changes and escalates immediately
  • Scaling actions: auto-scale Kubernetes deployments, EC2 instances, or Lambda concurrency in response to load anomalies
  • Service restarts: safely restart unhealthy pods, containers, or services with health-check validation before marking resolved
app.novaaiops.com · Auto-Execute
Auto-execute and rollback
What-If Simulation

Test remediation before it touches production

Before enabling auto-execute on a runbook, use what-if simulation to see exactly what would happen. Nova replays recent incidents through the runbook logic and shows you which steps would have executed, what resources would have been modified, and whether the issue would have been resolved. Build confidence in your runbooks before letting them run autonomously.

  • Incident replay: replay any past incident through a runbook to see if it would have been caught and resolved
  • Dry-run mode: execute runbooks in dry-run mode that logs every action without actually modifying any resources
  • Blast radius preview: see which services and resources a runbook will touch before enabling auto-execution
app.novaaiops.com · What-If Simulation
What-if simulation
Video Walkthrough

See Auto Remediation in action

Video walkthrough coming soon. Subscribe to our YouTube channel to get notified when it drops.

Subscribe on YouTube

Let AI fix incidents while your team focuses on building

954 pre-built runbooks, automatic execution on known patterns, safe rollbacks, and what-if simulation for production confidence.

Start Free Trial Request a Demo