AI & ML Advanced By Samson Tanimawo, PhD Published Nov 17, 2026 5 min read

AI Failure Modes: A Taxonomy

Production AI systems fail in characteristic ways. A taxonomy of the failure modes makes them debuggable and, sometimes, preventable.

Eight failure modes

  1. Hallucination: confident wrong output.
  2. Prompt injection: attacker-controlled content overrides instructions.
  3. Distribution shift: production inputs diverge from training distribution.
  4. Cascading errors: one bad agent step corrupts subsequent steps.
  5. Model regression: provider updates the model, your app breaks.
  6. Cost runaway: bug or pattern causes 10-100x normal token usage.
  7. Privacy leakage: model regurgitates training data.
  8. Tool misuse: agent uses a tool incorrectly with real-world consequences.

Detection

Each has a different signature. Hallucination → output divergence from grounded sources. Distribution shift → embedding drift. Cost runaway → token-rate alerts. Building specific monitoring for each pays back the first time one fires.

Response patterns

Postmortem template: which failure mode, which detection signal, root cause, fix, and what changed in the eval set. Treat AI incidents like infrastructure incidents, with the same discipline.