Characterizing Faults in Agentic AI: A Taxonomy of Types, Symptoms, and Root Causes
Mehil B Shah, Mohammad Mehdi Morovati, Mohammad Masudur Rahman, Foutse Khomh

TL;DR
This paper empirically analyzes faults in agentic AI systems, creating a taxonomy of 34 fault types, their symptoms, and root causes, validated by practitioners, to improve diagnosis and reliability.
Contribution
It introduces the first comprehensive taxonomy of faults in agentic AI, derived from large-scale empirical data and validated through practitioner feedback.
Findings
Identified 34 fault types across four architectural dimensions.
Discovered recurring fault propagation patterns among components.
Validated taxonomy as representative of real-world agentic AI failures.
Abstract
Agentic AI systems combine LLM-based reasoning, orchestration, tool invocation, and interaction with external environments. These systems introduce faults that are difficult to characterize using existing taxonomies. To address this gap, we present an empirical study of faults in agentic AI systems. We collected 13,602 issues and pull requests from 40 repositories and, using stratified sampling, selected 385 faults for analysis. Through grounded theory, we derived taxonomies of fault types, symptoms, and root causes. We then used Apriori-based association rule mining to identify relationships among faults, symptoms, and root causes, and validated the taxonomy through a developer study with 145 practitioners. Our analysis produced a taxonomy of 34 fault types, organized into four architectural dimensions. These faults manifested as failures in structured-output interpretation, tool…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
