Alert History Export

Alert history is data. Export it for analysis.

Why export alert history

Alerting tools keep 30-90 days of history by default, which is too short for trend analysis, postmortem reviews, or audit. Export to long-term storage (BigQuery, Snowflake, S3 plus Athena) with 18-24 months retention as the right floor; the export is the foundation for the cleanup ritual, the on-call survey, and the noise budget.

How to wire the export

The wiring is simple. PagerDuty webhooks fire on incident events; pipe to a Lambda or Cloud Function that writes to a warehouse table. Datadog and Prometheus Alertmanager both support webhook receivers using the same pattern. The schema is stable: incident_id, alert_name, fired_at, acked_at, resolved_at, owner_team, severity, labels.

Retention and access

Retention and access need policy. 18 months minimum (24 months covers full year-over-year with one rollover); encrypt at rest; strip PII at ingest via a deny-list on label names; restrict access to SREs and engineering leads because alert history reveals who burns out and which teams are noisy.

Queries that pay back the work

Three queries pay for the export immediately. Top noisy alerts drive the quarterly cleanup; time-to-ack and time-to-resolve per team drive rotation rebalancing; alerts during deploy windows catch deploy-induced noise that should be silenced or fixed.

Build vs buy

The build-vs-buy decision is data-driven. PagerDuty Insights and Datadog Watchdog cover the basics; use them until you outgrow the queries they support. Build a custom warehouse only when you need cross-tool analysis or labels they don’t expose; the queries you cannot run today are the justification.