Cross-AZ Latency Patterns

Sub-ms typical.

Overview

Cross-AZ latency in modern AWS regions is typically sub-millisecond, but it is not zero, and the bandwidth between AZs is paid where intra-AZ traffic is often free. The discipline is to design with cross-AZ latency in mind: co-locate latency-sensitive stateful pairs (database and primary writer) in the same AZ, replicate stateless services across AZs for resilience, and monitor cross-AZ bandwidth as a cost line item.

The approach

The practical approach is to co-locate database and primary writer in the same AZ for write-path latency, replicate stateless services across AZs for resilience, use AZ-aware load balancing where the LB supports same-AZ preference, monitor cross-AZ bandwidth as a cost line item, and document per-service AZ placement so the topology is reviewable.

Why this compounds

Cross-AZ awareness compounds across services. Each correct placement preserves write-path latency; each AZ-aware load balancing decision saves cross-AZ bandwidth; the team builds intuition for AZ topology that pays off on every new service.

Cross-AZ latency awareness is an operational discipline that pays off across years. Nova AI Ops integrates with cross-AZ telemetry, surfaces topology patterns, and supports the team’s network engineering discipline.