Rogers Canada 2022 Outage
Network failure.
Overview
On 8 July 2022 a Rogers Communications maintenance window cascaded into a nationwide telecom outage that took out internet, mobile, payment terminals, and 911 across much of Canada for most of the day. The lasting lesson is not the specific BGP misconfiguration; it is what happens when an entire country's payment, emergency, and connectivity layers all sit on a single telecom vendor.
- Maintenance-triggered cascade. A routing-policy change during planned maintenance propagated and broke the production network. Change-window risk is real.
- Single-vendor concentration. Banks, payment processors, and emergency services routed through Rogers exclusively. The blast radius matched the concentration.
- Critical-service impact. Interac payment terminals down, 911 access degraded, hospital systems affected. The dependency chain reached deeper than anyone had modelled.
- Recovery complexity plus regulatory response. Multi-hour restoration because rollback itself depended on the broken control plane; CRTC and government scrutiny followed.
The approach
Three habits make services resilient against single-vendor telecom failure: multi-vendor redundancy on critical paths, named failover for emergency-class functions, and documented supply-chain mapping so the next vendor outage is not the discovery moment.
- Multi-vendor redundancy. Critical paths route through two telecom providers. Cost is real; the alternative is being part of the next national-scale outage.
- Critical-service failover. 911, payment, and healthcare links have named secondary providers and tested switchover. The runbook is rehearsed.
- Documented supply-chain map. Per-service the upstream telecom dependencies. Discovery during the outage is too late.
- Game-day plus shared postmortem. Single-vendor failure is exercised; industry-wide postmortems published so lessons travel.
Why this compounds
The Rogers postmortem reshaped supply-chain thinking across Canadian critical-infrastructure operators. Each architecture review that applies the lesson reduces concentration risk a little more; the cumulative effect across the industry is meaningful.
- Single-vendor risk reduced. Multi-vendor architecture survives the kind of provider outage that would otherwise be terminal.
- Incident response stronger. Tested failover paths cut the outage minutes when a vendor degrades.
- Supply-chain awareness grows. The discipline becomes part of architecture review, not an afterthought.
- Year-one investment, year-two habit. The first multi-vendor design is heavy lift. By the third, vendor concentration is part of the standard review checklist.