Alert Priority vs Severity

Two attributes; different. Both matter.

Priority and severity are different axes

Severity is impact: how broken is the service for users. Priority is response order: which incidents do we work on first. Conflating the two is why backlog grooming sessions devolve, because a sev3 can still be P1 for a customer with a contractual deadline.

How to define severity

Severity definitions need to be sharp enough to trigger the right response automatically. Sev1 is revenue-impacting and customer-visible with no workaround; sev2 is degraded but not broken; sev3 is minor or cosmetic. Each tier maps to a specific response posture.

How to define priority

Priority is about scheduling: when does this work happen relative to everything else. P1 cancels other work; P2 fits in this sprint and blocks the next; P3 sits in the backlog and gets reviewed at planning. Three tiers is enough; more fragments the data.

Mapping the two

The matrix surfaces interesting cases. Most sev1 incidents are P1 and most sev3 issues are P3, but sev2/P1 (a workaround exists but the customer pays for fast fix) and sev1/P2 (production broken but contained, post-incident work scheduled) are the cases that need explicit policy.

Standardize before scaling

Standardisation is cheap before scaling and expensive after. Pick definitions before adding more teams; keep three tiers each; audit usage quarterly because tier inflation (everyone tags everything sev1) is the failure mode to watch for.