Datadog as a First-Class Tool for SRE Agents

Six Datadog API endpoints become six agent tools. The wrappers, the rate-limit handling, and the prompt patterns that get models to query Datadog effectively.

Six endpoints, six tools

The model never sees Datadog’s full API. It sees six narrow tools, each scoped to a specific question the agent needs to answer during triage. Six is the cap because the prompt budget for tool descriptions is finite and a thin tool surface keeps eval cost manageable.

Wrappers, not raw API

Every tool is a Python wrapper, not a raw HTTP call assembled from the prompt. The wrapper is where defaults, scope limits, and result shape are enforced, which is what makes the agent’s behaviour predictable across thousands of runs.

Rate limiting

Datadog rate limits per API key. An agent that retries on every error will exhaust the budget in minutes, so the wrapper layer absorbs back-pressure before it ever reaches the model.

Prompt patterns that work

Tool descriptions inside the system prompt earn their tokens by reducing wrong-tool calls. The patterns below cut tool-selection errors by more than half in our internal evals.

Eval cases

The eval set is what tells you whether a prompt or tool change shipped a regression. Run it on every change to the wrapper layer or the tool descriptions.