Best Practices Intermediate By Samson Tanimawo, PhD Published Sep 30, 2025 6 min read

Anti-Burnout Practices for SRE Teams That Actually Work

Burnout in SRE is structural, not personal. The team that runs the same on-call rotation for two years without changing it produces burned-out engineers, regardless of the engineers' resilience.

Burnout is structural, not personal

The conversation usually frames burnout as something individual engineers fail at. The truth is that the rotation, the page volume, and the org's response to incidents are the variables that produce burnout, and they are knobs leadership controls. Treating it as personal is leadership avoiding the work of changing the structure.

The structural reframe matters. Personal-burnout framing leads to "wellness programs" — meditation apps, mindfulness training, vacation reminders. These don't reduce burnout because they don't change the underlying load. Structural framing leads to changing rotation, alert volume, and recovery time — which actually move the needle.

The leadership accountability. Burnout is a leadership signal. A team experiencing burnout is being run with too few people, too much noise, or too little recovery. Each is something leadership can change. Reframing as structural makes the conversation actionable.

Measure before resignations

Three metrics. Page volume per shift. Pager-acks outside business hours. Voluntary attrition on the team versus comparable teams. All three move months before the resignations land. Watch them.

The leading-indicator framing. Resignations are lagging indicators — by the time someone quits, they've been disengaged for months. Leading indicators (page volume, after-hours acks, voluntary swaps decreasing) appear 3-6 months earlier. The team that watches leading indicators can intervene; the team that watches resignations is reacting too late.

The dashboard. A simple monthly review: page volume per engineer per shift, % of pages outside business hours, voluntary shift swaps (decreasing means trouble). The metrics are easy to compute; the discipline of watching them is what matters.

Five structural levers

Rotation length, page volume, comp time, scope shift, sabbatical. Each one moves the burnout dial.

The five levers' independence. Each operates on a different mechanism. Rotation length affects recovery cadence. Page volume affects acute load. Comp time affects recovery from bad shifts. Scope shift affects autonomy. Sabbatical affects long-term restoration. A team needing intervention typically benefits from multiple levers simultaneously.

Rotation length

One-week rotations are gentler than two-week ones. Two-week rotations are gentler than month-long ones. Cost: more handoffs. Worth it; handoffs are routine, recovery from burnout is not.

The math. A one-week shift gives the engineer a 1-week-on / 5-week-off (or 7-week-off in larger rotations) cadence. Recovery is fast. Two-week shifts are 2-on / 6-off. Engineers absorb more during the on period and need more recovery; the cumulative load is higher despite the same total time.

The handoff cost objection. "We do 2-week rotations to amortise handoff cost." Handoffs are 60 seconds. The handoff cost is genuinely small; the recovery benefit of shorter rotations is large. The objection is rationalisation, not real economics.

Page volume per shift

Aim for under 5 pages per week per on-call. Above 10 and the team is in slow burnout. Above 20 and the team is in fast burnout, regardless of how stoic everyone sounds in standup.

The volume's compound effect. 10 pages/week × 2-hour-context-switch each = 20 hours/week of disruption. The on-caller is officially on-call for 168 hours; 20 hours of those are interrupted focus. Compound across a 2-week rotation — that's 40 hours of disrupted focus, more than a full work week.

The reduction work. Every alert review (covered separately) directly reduces this number. A team at 12 pages/week that runs an alert-quality sprint can drop to 5; the on-callers feel the difference immediately.

Comp time

If a shift is bad, the engineer takes a paid day off. Do not negotiate. The cost is small; the message is large; the engineer recovers instead of accruing.

The pattern. After a shift with 15+ pages, the engineer takes the next Friday off. Not "uses PTO" — paid recovery. The cost to the company is one engineer-day; the cost of NOT giving it is a slow erosion of engagement.

The signaling effect. Comp time visibly says "leadership knows that bad shifts are bad and is willing to spend on recovery." Without it, engineers feel like they're absorbing the cost of the team's reliability problems personally. The recovery message changes the dynamic.

Scope shift

After a bad rotation, the engineer's next sprint is non-on-call work. They stop doing the sprint as scheduled and pick up automation work that reduces the page volume that just hurt them. The team gets healthier; the engineer feels like the system listened.

The two-fer. Bad rotation produces an engineer who: (a) needs recovery, (b) has acute knowledge of what alerts were noisy. Scope shift addresses both — they recover by working on automation that fixes the noise. The next rotation is calmer; the next engineer benefits.

The leadership commitment. Scope shift requires leadership to defend the engineer's sprint commitment changing. "Sara was supposed to ship Feature X this sprint; now she's improving alert quality. Feature X slips a week." Leadership has to back this; otherwise scope shift becomes optional and falls apart.

Sabbatical

One month off after every 3 years on rotation. Not negotiable. The cost is one month of one engineer per 3 years; the savings are not having to re-hire that engineer in year 4.

The economics. A senior SRE who leaves costs ~$300k to replace (recruiting, ramp time, lost productivity). A one-month sabbatical costs ~$25k. The replacement cost is 12x the sabbatical cost. If the sabbatical prevents even occasional resignations, the math is clear.

The cultural signal. Companies that offer sabbaticals attract senior engineers. Companies that don't lose them. The benefit is a hiring advantage, not just a retention tool.

Two anti-patterns that look like care

"Mental health days" with no follow-on change to the rotation: feels caring, changes nothing. Mandatory wellness training: feels caring, changes nothing. Both are theatre; neither moves any of the structural levers above.

The pathology. Leadership wants to address burnout; chooses interventions that are visible but don't change load. The team experiences these as performative — leadership is checking a box rather than addressing the cause. Trust erodes.

The counter. Pair any wellness intervention with a structural change. "We're offering mental health days AND we're cutting alert volume by 30% this quarter." The combination signals real intent; the structural change is what actually helps.

What to do this week

Three moves. (1) Compute your team's pages/week/engineer for the last 90 days. The number reveals which band you're in. (2) Pick the single highest-leverage structural lever for your team. If page volume is high, attack that. If rotation length is too long, propose shortening. If recovery isn't happening, institute comp time. (3) Surface burnout as a structural conversation with leadership. The conversation framing — structural not personal — is what unlocks the structural responses.