Deploy Rollback Discipline

Rollback should be 1 command.

Speed targets

Rollback speed is customer-impact duration. Every minute the bad version is live is a minute of customer pain; the speed bar should be a published number, not a vibe.

Automation patterns

Three automation patterns cover almost every rollback case: one-command rollback, auto-rollback on metric breach, and human override always available alongside both.

Testing rollback

Quarterly drills are how rollback stays real. The procedure decays without exercise; the first time you run it cannot be the time the customer is paged.

Constraints and trade-offs

Three constraints limit rollback in real systems: schema migrations, external API contracts, and stateful services. Plan accordingly so the deploy that needs to roll back can.

Operating rollback discipline

Rollback discipline is operational, not theoretical. Track the metrics, run the drills, and treat regression rollbacks as inputs to deploy hardening.