The Percentile Trap in Aggregated Metrics

p99 of p99 is not p99. The math that breaks aggregation, the cases where it matters, and the workarounds.

The math

The percentile trap in aggregates is a common metric mistake. Computing percentiles per instance and then averaging them produces wrong results; the wrong results look correct and mislead decisions. Recognizing the trap and fixing the aggregation is the discipline.

What the math says:

The math is the foundation. Understanding why averaging percentiles is wrong is the prerequisite for fixing it.

Histograms solve it

The fix is to use histograms instead of pre-aggregated percentiles. Histograms aggregate correctly; the percentile is computed from the aggregated histogram.

Histograms are the right tool for percentile aggregation. The pattern is well-known; the discipline is using it.

When it matters

The trap matters when aggregate percentiles drive decisions. Single-instance percentiles are correct; the trap appears in aggregation across instances, regions, or time periods.

The percentile trap in aggregates is one of those metric mistakes that everyone makes once and learns from. Nova AI Ops integrates with metric stores, surfaces percentile aggregation patterns, and helps teams identify and fix the trap before it produces wrong decisions.