Recovering From a Saturated On-Call
When the on-call has been pinned for 3+ days, normal recovery does not work. The 5-step protocol for getting the team back to baseline.
The 5 steps
When the on-call has been pinned for three or more days, the team needs a structured recovery protocol. Normal recovery does not work because saturation compounds; the protocol breaks the cycle.
- Step 1: pause non-critical work. Marketing campaigns, feature launches, anything that adds complexity; buys cognitive room.
- Step 2: bring in extra on-call. Additional engineer from another team for 48 hours; lets the burned-out engineer actually rest.
- Step 3: triage the backlog. Some items become 'will not fix'; the rest get owners and dates.
- Step 4: kill the burning fire. Identify the single thing causing repeat alerts; fix it before resuming normal cadence.
- Step 5: resume after a quiet shift. Only after a full quiet shift; premature resumption produces relapse.
Signs you need this protocol
The protocol is not for normal busy weeks. Three signs together signal real saturation; recognising them early prevents the worst outcomes.
- Sleepless nights. Three or more in a week; not just busy, actually unable to sleep through.
- Quality drops. On-call making mistakes the 24-hour-rest version of them would not make.
- Stakeholder doubt. Stakeholders questioning team capability; burned-out teams produce visible quality drops.
- Volunteer attrition. Engineers asking to leave the rotation; the canary in the mine.
Avoid
Three reactions make saturation worse. Each one is the natural human response and each one is exactly the wrong move.
- Heroism. 'I can power through' is how saturated on-calls become quitting on-calls.
- Normalising. Pretending the saturation is the new normal; the data is signal, act on it.
- Blame. Saturation usually has a system cause; finding it is more useful than finding fault.
- Quiet escalation. Telling leadership only when the team breaks; surface earlier so the support arrives in time.