Alert Noise by Team Attribution

Some teams' alerts are noisier. Attribute and act.

Why per-team attribution matters

Without attribution, alert noise is SRE’s problem; with attribution, it’s the team that wrote the rule. Per-team noise scores reveal which teams generate the most noise per service, and the surprising results are the norm. Attribution drives behavior change faster than any internal training program.

How to attribute

Attribution needs three mechanisms. Tag every rule with a team label as mandatory metadata; aggregate fire counts by team weekly with a leaderboard sorted by signal-to-noise; pull team mappings from the service catalog and route untagged rules to an unowned bucket that SRE actively shrinks.

Metrics to publish

Five metrics tell the story. Total fires per team per week, auto-resolved fires per team, pages per on-call shift per team, cost per page (vendor fees plus interruption time, where some teams generate 10x others), and improvement velocity (week-over-week change in noise score).

Aligning incentives

Aligning incentives makes attribution real. Tie SLO budget to noise budget so the top-quintile noise team loses change-management privileges until the score drops; make new rule creation conditional on existing noise score with a delete-one-add-one rule for high-noise teams; recognise improvement publicly to reinforce the discipline.

Start with a public dashboard

Don’t enforce penalties before publishing data. Visibility alone moves the needle 30%; skip naming-and-shaming language and frame as team improvement; audit attribution accuracy monthly because misattributed noise erodes trust in the whole system.