MTTA: Time From Page to Acknowledged
MTTA is response readiness. Shrinking it shrinks MTTR.
Definition
MTTA is time from page-sent to on-call acknowledgement, not from incident-start (which is MTTD plus MTTA). Acknowledge means “I am responding”, not “fixed”: the pager stops re-paging and the team knows someone is on it. Typically measured per page and aggregated per shift, per engineer, per service.
- Page-sent to ack. Not from incident-start; that’s MTTD plus MTTA; the two metrics are distinct.
- Ack means engaged. Not fixed; the pager stops re-paging; the team knows someone is on it.
- Aggregated views. Per page, per shift, per engineer, per service; different views surface different problems.
- Per-team baseline. The team’s own MTTA history is the comparison; absolute numbers vary by domain.
Reasonable targets
Targets scale with severity. Sev 1: under 5 minutes; Sev 2: under 15; Sev 3: under 1 hour during business hours. Aim for 95th-percentile within target rather than median because outliers are where incidents go wrong; per-service tier customer-critical services tighter, internal services looser.
- Sev 1: under 5 minutes. The headline target; the most urgent class.
- Sev 2: under 15 minutes. Less urgent but still time-bounded.
- Sev 3: under 1 hour business hours. Routine work; no off-hours requirement.
- 95th percentile within target. Median is too forgiving; outliers are where incidents go wrong.
How to consistently hit it
Three mechanisms drive consistent MTTA. Multi-channel paging covers phone DND, dead battery, no signal as real failure modes; backup on-call gets paged when primary doesn’t ack within 5 minutes; quarterly synthetic pages verify the chain end-to-end so surprises during real incidents are caught in advance.
- Multi-channel paging. Phone, app, SMS, voice escalation; phone DND, dead battery, no signal are real failure modes.
- Backup on-call. If primary doesn’t ack in 5 minutes, secondary gets paged; tested monthly.
- Quarterly synthetic pages. Verify the chain end-to-end; surprises during real incidents are expensive.
- Per-rotation chain test. The full ack chain tested per rotation; supports rotation-specific issues.
Reading MTTA data
The MTTA trend tells different stories. Trend up means paging tool degrading, rotation understaffed, or on-call fatigue, and deserves investigation; trend down is usually good but watch for ack-without-action and track MTTR separately; the 99th percentile is the worst-case signal often more useful than the median.
- Trend up. Paging tool degrading, rotation understaffed, on-call fatigue; investigate.
- Trend down. Usually good; watch for ack-without-action; track MTTR separately.
- 99th percentile signal. Worst-case response; often more useful than the median.
- Per-week trend dashboard. Trend visible to the team; supports continuous awareness.
When MTTA is bad
Bad MTTA has three causes in order. First: tool reliability, because lost pages mean lost MTTA, and paging tool dashboards show delivery rates. Second: rotation health (burnout, vacation gaps, off-hours coverage), which a survey plus staff-up addresses. Third: process, where routing fixes are usually high-leverage.
- Tool reliability first. Lost pages mean lost MTTA; paging tool dashboards show delivery rates.
- Rotation health second. Burnout, vacation gaps, off-hours coverage; survey, staff up.
- Process third. Right person paged, alert properly classified; routing fixes are usually high-leverage.
- Per-cause investigation playbook. Each cause has a specific check; supports fast diagnosis.