Three Eval Categories Every SRE Agent Needs

Capability evals. Safety evals. Cost evals. Why all three, what goes in each, and the failure modes of having only the first.

Category 1: capability evals

Capability evals are the category every team starts with. They are necessary but not sufficient on their own; a capable agent can still be unsafe or expensive.

Category 2: safety evals

Safety evals catch the failures that capability evals miss. A capable agent that overreaches passes capability and fails safety; both are required.

Category 3: cost evals

Cost evals are the early-warning system for cost drift. Without them, prompt growth and model changes silently blow the per-run budget.

How the three interact

The three categories trade against each other. Optimising one in isolation produces fragile agents; the diff vector across all three is what the reviewer reads.

Why three is the right number

Three is empirically the right granularity. The convergence across major platforms suggests the shape, not the names, is what matters.