Alert vs Dashboard Decision

Some signals belong on dashboards, not in alerts.

The decision rule

The decision is action-driven. Alert if customer impact is happening or imminent, time-sensitive action is required, and someone needs to act now; dashboard if trending data, aggregate metrics, situational awareness, or post-hoc analysis (data informs decisions but doesn’t demand immediate action). Mixing the two creates fatigue: dashboards full of pageable signals get ignored, pages that should have been dashboards burn out the on-call.

Strict criteria for alerts

Three criteria must all hold for a signal to be an alert. Customer impact (real or imminent; signals with no customer connection like CPU at 80% are dashboards, not pages); action exists (an alert without a runbook is a notification of helplessness, find an action or move it to a dashboard); time-sensitive (if the action can wait until business hours, the alert can wait).

Dashboard criteria

Three categories belong on dashboards. Trends and aggregates (week-over-week, month-over-month, capacity planning, SLO burn-down); operational awareness (on-call checks at start of shift, during incidents dashboards inform but don’t drive paging); stakeholder reports (business metrics, customer counts, revenue, audience is decision makers).

Converting between them

Conversion goes both ways. Frequently-firing alerts that operators dismiss without action are dashboard candidates (track per-alert action rate, below 50% means not an alert); dashboard panels that surface real problems people only see in postmortems are alert candidates (convert when the pattern repeats); quarterly review of both directions because each conversion is a small win and the cumulative effect is significant alert quality.

Anti-patterns

Three anti-patterns survive too long. Dashboards full of red panels nobody investigates (dashboards are not alerts; visual urgency creates anxiety without action); alerts that exist for reassurance (“alert if too quiet”) without clear meaning (define what “too quiet” means and what to do, or remove); both alert and dashboard for the same signal (pick one based on the action, or ensure they have different audiences and clear ownership).