Pager Load Balancing Across Services
Some services page more. Distribute the load.
Rotate engineers across services
Engineers rotate through services within their domain. Quarterly or per-incident-volume.
Spreads expertise. Multiple engineers know each service.
Spreads load. Same person doesn't always carry the noisy service.
Match staffing to volume
Noisy services get larger rotations. 8-10 engineers if pages are frequent.
Quiet services share rotation across multiple. Three-engineer rotations are sustainable for low-volume services.
Cross-service backup. Engineers from quieter services back up busier ones during peaks.
Quarterly review
Per-engineer pages-per-shift across services. Imbalances surface.
Investigation: is the load real or alert noise? Often the answer drives tuning, not rotation changes.
Don't just rotate harder. Address underlying causes; tune alerts; fix flapping services.
Compensate for outliers
Engineers carrying extra load deserve recognition. Compensation: time off, stipends, public credit.
Geographic and time-zone effects. Engineers in bad time zones get more off-hours pages; compensate or rotate.
Don't ignore the data. Pages-per-engineer trends inform retention; bad rotations drive departures.