PagerDuty Routing Rules: The Hard Cases

Routing alerts to the right team. The hard cases and the patterns.

The easy cases

The easy cases work natively in PagerDuty. Service to team, severity to escalation policy, business hours vs after hours; PagerDuty handles these with event rules. Most teams stop here because the catalog is small and the rules are clear; trouble starts when business and topology force exceptions.

Cross-team services

Cross-team services break the simple model. Service owned by team A but rules into a feature owned by team B; a page on the feature should hit team B, a page on the platform should hit team A. Use PagerDuty event orchestration with custom fields where the alert payload tags the feature and rules route accordingly; document the routing decision in the runbook.

Time-of-day routing

Time-of-day routing supports follow-the-sun coverage. Pages route to APAC overnight, EU during European day, US during American day; PagerDuty schedules support this and event orchestration can override per service; test the boundaries because the 06:00 UTC handover is where routing bugs live.

Vendor and third-party pages

Vendor pages are signals, not pages for most services. AWS Health, Cloudflare incidents, GitHub status; route to a Slack channel by default and page only if the affected service is tier 1; use Statuspage’s API to fan in vendor incidents to your alerting backbone and then apply your own routing logic.

Apply this quarter

The application is concrete. Audit your event rules because anything older than 6 months without a recent edit is suspect; document each non-trivial rule in a comment field for the next person to touch it; run a synthetic page test monthly because routing breaks silently and only synthetic tests catch it.