Connection Warmup

Warm pools; cold-start.

Overview

Connection warmup pre-establishes connections so the first real request does not pay the connection cost. Cold-start latency in modern stacks comes from TCP three-way handshake, TLS handshake, TCP slow-start congestion-window ramp, and HTTP/2 stream initialization. Each of these is a real round-trip cost that the user pays unless the connection is already warm. The discipline is to pre-warm on service startup, keep-alive during idle to preserve the warm state, and monitor tail latency to catch when warmup breaks.

The approach

The practical approach is to pre-warm connection pools on service startup (synthetic requests through the pool establish the connections), enable keep-alive during idle to prevent connection close (idle timeout longer than the typical inter-request gap), enable TLS session tickets across instances so horizontal scale preserves resumption, monitor p99 and p99.9 latency to catch warmup regressions, and document the per-service warmup sequence in the service repo.

Why this compounds

Connection warmup compounds across services. Each warmed connection produces consistent latency where cold connections would spike; the team builds intuition for connection-cost patterns that pays off on every new service. Without the discipline, tail latency stays unpredictable and the team chases spikes that warmup would have prevented.

Connection warmup is an operational discipline that pays off across years. Nova AI Ops integrates with latency telemetry, surfaces warmup patterns, and supports the team’s performance discipline.