Network Monitoring: The Five Numbers

Network monitoring is often network-team-only. SREs benefit from these five numbers being visible.

Why five

Network failures rarely look like network failures from the application's point of view. Five wire-level metrics catch most of what app metrics miss.

The five metrics

Dashboard pattern

One dashboard per service tells the network story at a glance. Five panels for the five metrics, plus drill-downs for the per-dependency view.

Alert thresholds

Thresholds depend on baseline. Hard-coded numbers are starting points; tune them once you have a week of data.

Antipatterns

What to do this week

Three moves. (1) Apply this pattern to your highest-risk network path. (2) Measure the failure mode rate before/after. (3) Document the change so the next incident-responder inherits the knowledge.