GLOSSARY · L

Latency

The time between a request starting and a response arriving, the user-perceived 'is it fast' metric.

Definition

Latency is the elapsed time between a request being sent and a response being received. It's measured at percentiles (p50, p95, p99) rather than averages because averages hide the tail: a service can have a 50ms average and a 2-second p99, and the p99 is what 1% of users feel. SRE teams set SLOs on latency at high percentiles because tail latency, not average latency, drives churn.

Why it matters

Latency is the most directly user-perceived signal in the golden-signals set. A slow service feels broken even when error rates are zero. Latency SLOs that include the percentile (e.g. '99% of /checkout responses under 500ms') are dramatically more useful than average-latency targets, because they constrain the tail that customers actually experience.

How Nova handles it

See the part of the platform that handles latency in production.

Nova golden signals

Latency

Definition

Why it matters

How Nova handles it

Related terms