CI/CD & GitOps Practical By Samson Tanimawo, PhD Published Nov 14, 2025 4 min read

Deploy Postmortem When It Fails

Failed deploy → postmortem.

Trigger

Most engineering organizations have a postmortem process for incidents. Fewer have one for failed deploys. The two are related but different: an incident postmortem covers the full operational story; a deploy postmortem focuses specifically on what got past the deploy gates and why. Adding deploy postmortems to the practice is one of the highest-leverage CI/CD improvements a team can make.

What should trigger a deploy postmortem:

The trigger is what makes the practice routine. Without an automatic trigger, deploy postmortems happen only when someone remembers to ask for one, which is rarely.

Focus

The deploy postmortem has a specific focus that incident postmortems often miss: the gap between the deploy gates that should have caught the regression and the regression itself. Why did CI pass? Why did canary not catch it? Why did the soak window expire successfully? Each "why" closes a specific gap in the safety net.

The focus on gates makes deploy postmortems different from incident postmortems. The incident postmortem asks "what happened?"; the deploy postmortem asks "why did our deploy safety net fail?"

Action items

The output of every deploy postmortem is action items. Concrete, owned, time-bound. Without action items, the postmortem is a debrief; with them, it is a real safety improvement.

Deploy postmortems are the practice that turns deploy failures from operational pain into systematic improvements. Nova AI Ops auto-opens postmortem tickets on rollback events, links them to the deploy log and the contributing metrics, and tracks the action items so the lessons from each failure produce a permanent improvement to the safety net.