Ground-Truth Verifier checks every remediation against reality at three time horizons: T+5 minutes, T+1 hour, and T+24 hours. The check compares the SLI value to the SLO target. If the symptom is gone and stays gone, the action is verified. If it comes back, trust drops and the action is flagged for review. Self-reported success no longer counts.
Different remediations regress at different timescales. A pgbouncer restart that fails comes back in seconds. A cache invalidation that was wrong shows up at peak traffic an hour later. A schema change that broke a downstream report only matters at the next batch run. Three windows let us catch all three.
A remediation does not pass just because nothing else broke. The agent declares the SLI it intends to fix and the threshold it expects. The verifier compares the live SLI value at each window to the threshold. Pass requires the SLI to be at or above the threshold. Anything else is a regression, not a "well, kind of."
The verifier is the ground truth that drives the Agent Ledger's trust scores. Each pass adds to trust, each regression subtracts. Multiple regressions in a row demote the agent's autonomy tier automatically. The trust system is not based on the agent's own assessment of its work; it is based on whether reality stayed fixed.
When you write a postmortem, the verifier history is your timeline. It shows which agents acted, which actions claimed to fix the problem, and at which window the fix held or broke. The page is filterable by incident, agent, and outcome so you can pull every verifier event for one incident in one click.
Subscribe to Nova AI Ops on YouTube for demos, tutorials, and feature deep-dives.
Every verified remediation pushes trust up. Every regression pushes it down. The agents that survive are the ones whose fixes hold.