The DNS Resolution Agent: Why It's a Good First Project

Bounded scope. Read-only signals. Clear success criteria. Why the DNS investigation agent is the project to ship before harder ones, plus the skeleton.

Why this scope is good

The scope is what makes DNS the right project zero. Bounded inputs: a hostname, a query type, an environment, three inputs and nothing else. Read-only signals: dig, nslookup, DNS cache contents, nothing the agent does changes the world. Clear success criterion: did the resolution succeed, what was the answer, if it failed why.

Tools the agent needs

The agent needs three tools. dig wrapper queries authoritative servers and returns structured output; cache lookup checks local resolver caches and catches stale-cache issues; DNS-DB query pulls records from the authoritative source for the zone and compares with what the resolver returned.

Output classes

The agent classifies into four buckets. Resolved correctly (most cases, agent confirms and exits); NXDOMAIN (hostname does not exist, sometimes a typo, sometimes intentional); stale cache (resolver returned an old answer, cache flush usually fixes); authoritative misconfiguration (zone has a problem, escalate to the team that owns the zone).

Why this is project zero

DNS is project zero because the failure modes are bounded and the runbooks already exist. DNS issues are common and well-understood and the team has prior runbooks (the agent is a translation, not a from-scratch design); failure modes are bounded (the worst the agent can do is return a wrong classification, humans verify before acting); success is satisfying because correct classification in 3 seconds produces team buy-in.

What to graduate to next

The graduation path is concrete. After DNS, try certificate-expiry investigation (similar bounded scope, similar read-only signals); after that, try external-service-status checks (the probe-classify-report pattern is common across many SRE problems); by the third agent, the team has internalised the agent-building pattern and subsequent agents come faster.