Read Replicas Strategy

Use; offload; replication lag.

Overview

Read replicas scale reads horizontally by accepting that some queries can tolerate slightly stale data. The catch is replication lag: replicas are eventually consistent with the primary, lag is usually milliseconds but spikes during high write load, and reading a replica immediately after writing to the primary returns the pre-write state. The discipline is matching each query to the consistency it actually needs and routing accordingly, with explicit fallback for the queries that cannot tolerate lag.

The approach

The practical approach is per-query routing decisions made at the application layer, lag-aware routing that excludes lagging replicas from the rotation, read-after-write fallback for the user-affecting subset, per-replica lag monitoring that alerts when lag exceeds tolerance, and a documented per-database routing strategy committed to the repo so the model is reviewable.

Why this compounds

Read replica discipline compounds across queries and quarters. Each correctly-routed read offloads the primary; the read tier scales linearly with replica count; user-visible correctness stays intact because the routing respects per-query consistency requirements. After a year, the team has a vocabulary for read-write routing that pays off on every new feature.

Read replica discipline is a database discipline that pays off across years. Nova AI Ops integrates with database telemetry, surfaces lag patterns, and supports the team’s read-routing discipline.