Reverting Agent Actions: The Undo Strategy You Need

Agents make mistakes. The undo store, the reversibility classifier, and the human escalation path for actions that cannot be undone automatically.

Classify actions by reversibility

Reversibility is the property that decides everything else. Three classes capture the realistic shapes; the recovery strategy for each is different.

The undo store

The undo store is the artefact that makes revert real. Without it, “undo” is wishful thinking.

The revert UI

The UI matters because reverts happen under stress. Three properties keep the operator from making the wrong move when seconds matter.

Class C and the escalation path

Class C actions cannot be undone. Recovery is human-led; the agent’s job is to hand the human the right context, fast.

Limits of revert as a safety strategy

Revert is one safety mechanism, not the safety mechanism. Three limits show why it cannot stand alone.