SRE Agent Guardrails: A Defense-in-Depth Checklist

Eleven independent guardrails, each with a different failure model. The checklist, what each catches, and the order to add them as your agent matures.

The 11 layers

Defence in depth is the right shape because no single layer catches every failure. The 11 layers below are the canonical set most production agent platforms converge on.

Each layer covers different failures

The layers are independent on purpose. A failure that slips one layer is caught by the next; a failure that slips all 11 is exceedingly rare.

Order to add them

Adding all 11 at once stalls. The order below ships a safe-but-useless agent on day one and adds capability layer by layer with evidence.

When 11 is too many

Not every agent needs every layer. The two situations below are the only ones where the full set is overkill.

Testing the layers

Untested layers are decoration. The eval set must exercise each layer individually and the layers in combination.