Preemption Latency

K8s preemption impact.

Overview

Preemption latency captures how Kubernetes preemption affects user-facing latency. Capacity planning addresses the average; preemption produces tail-latency spikes that capacity planning never sees.

The approach

The practical approach: PriorityClass design per workload, PodDisruptionBudget for production deployments, eviction monitoring as routine, spot tolerance only where it fits. The team’s discipline produces predictable performance under preemption pressure.

Why this compounds

Preemption latency discipline compounds across services. Each protected workload preserves user experience; the team’s K8s expertise grows; new services inherit the priority and PDB defaults.

Preemption latency discipline is an operational discipline that pays off across years. Nova AI Ops integrates with K8s telemetry, surfaces patterns, and supports the team’s K8s discipline.