MTTD: The Metric Behind MTTR

MTTR includes detection time. Lowering MTTD often lowers MTTR more than lowering response time.

MTTD

Mean time to detect is the gap between when an issue starts affecting users and when monitoring tells someone about it. In most postmortems it is the largest component of total time-to-resolve, and investment here pays back faster than investment in any other phase.

Improve

MTTD improvement comes from earlier-firing alerts and broader coverage. Wins compound across hundreds of incidents because the same detection latency repeats every time the same class of fault recurs.

Track

Tracking MTTD requires capturing both timestamps for every incident. The gap between them is the optimisation target; without the data, no investment can be justified.