Pool Size vs Throughput

Optimal pool size.

Overview

Pool size and throughput have a non-linear relationship: throughput rises with pool size up to a peak, then degrades as contention dominates. The peak is per-application: a CPU-bound workload peaks at smaller pool sizes than an IO-bound one, and a latency-sensitive workload usually wants pools sized below the throughput peak so queuing time stays bounded. The default value the framework ships with is almost never the right one for production workloads.

The approach

The practical approach is to benchmark per application against representative load, monitor pool utilization and per-request latency continuously, size below the throughput peak when latency matters more than throughput, retest quarterly because workloads drift, and document the per-pool size with its supporting curve in the infrastructure repo so the choice is reviewable.

Why this compounds

Pool sizing discipline compounds across services. Each correctly-sized pool produces ongoing throughput; each documented curve survives team turnover; new services inherit the benchmarking practice rather than re-deriving it. The opposite, where pool size is set once at deploy time and never revisited, accumulates inefficiency and surprise during traffic spikes.

Pool sizing discipline is an operational discipline that pays off across years. Nova AI Ops integrates with database telemetry, surfaces pool patterns, and supports the team’s capacity engineering discipline.