Database Monitoring 2026
Slow queries; locks; vacuum.
Overview
Database monitoring in 2026 captures the right database signals to drive operations and investigation. CPU and memory are the easy metrics; slow queries, locks, vacuum lag, and replication lag are what determine whether the database is actually healthy.
- Slow queries, locks, vacuum. Per-database key signals; surface the database-specific failure modes that CPU does not predict.
- Per-query latency. Per-query p99 latency; supports investigation; identifies the hot query before users complain.
- Lock waits. Per-database lock wait time; the metric that warns before deadlocks become incidents.
- Vacuum plus replication lag. Per-table vacuum lag for Postgres; per-replica replication lag for HA.
The approach
The practical approach: per-query latency tracking, lock-wait monitoring, replication-lag dashboards, vacuum activity for Postgres, documented per-database dashboard. The team’s discipline produces healthy databases instead of mysterious slowdowns.
- Per-query tracking.
pg_stat_statements, MySQL Performance Schema; the source of truth for query-level signal. - Lock monitoring. Per-database lock wait time; deadlock and contention surface before user-visible impact.
- Replication lag. Per-replica lag; matches HA; the failover decision needs current lag data.
- Vacuum activity. Per-table vacuum lag; Postgres-specific; bloat accumulates without it.
- Document the dashboard. Per-database dashboard committed to the repo; supports operational reviews.
Why this compounds
Database monitoring discipline compounds across databases. Each captured signal supports investigation; the team’s database expertise grows; cost-per-incident falls as the dashboards mature.
- Faster investigation. Right signals support fast root cause; the slow query is identified before the alert escalates.
- Better performance. Per-query tracking supports optimisation; the hot query gets tuned before it becomes incident.
- Better operational fit. Right signals match the workload; OLTP and OLAP need different dashboards.
- Institutional knowledge. Each signal teaches database patterns; the team’s database engineering muscle grows.
Database monitoring discipline is an operational discipline that pays off across years. Nova AI Ops integrates with database telemetry, surfaces patterns, and supports the team’s database engineering discipline.