Kill Switch is the panic stop for the AI fleet. One agent acting weird? Pause it. One tenant misbehaving? Quarantine it. Whole platform looking off? Halt everything. Sub-second propagation, three scopes, fully audited. Designed to be safe to press.
Different incidents need different scopes. A misbehaving agent gets paused at the agent scope (everyone else keeps working). A bad tenant deploy gets quarantined at the tenant scope. A platform-wide regression gets the global kill. The button only does what its scope is configured for, so you cannot accidentally halt the world.
When you press kill, every running agent transitions to read-only mode. They can still observe, log, and reason, they just cannot execute any tool that mutates state. In-flight tool calls finish or roll back to a checkpoint. Nothing that was healthy becomes unhealthy because you pressed kill.
Patterns where teams reach for the kill switch: (1) a model upgrade from your provider produced regressions and the fleet is over-acting, (2) you are running a chaos game day and want to take humans-only for an hour, (3) a downstream provider (cloud, monitoring, paging) is degraded and you want to stop acting on stale data.
Pressing kill writes a row to Agent Ledger with the operator id, the scope, the optional reason, and the duration. Releasing kill records the warm-restart sequence. Use the report view to see how often you are pressing each scope and whether you are converging on stability or hitting the same wall every week.
Subscribe to Nova AI Ops on YouTube for demos, tutorials, and feature deep-dives.
Adopting AI for ops feels safer when you know exactly how to halt it. Kill Switch is that lever, and pressing it never breaks anything.