XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable Insights
Arun Joshi

TL;DR
This paper introduces a systematic XAI framework that transforms raw execution traces of LLM-based coding agents into structured, human-interpretable explanations, significantly improving failure diagnosis and fix proposals for developers.
Contribution
It presents a novel domain-specific failure taxonomy, an automatic annotation system, and a hybrid explanation generator for better interpretability of agent failures.
Findings
Users identify failure root causes 2.8 times faster
Propose correct fixes with 73% higher accuracy
Outperforms ad-hoc explanation models in consistency and insight
Abstract
Large Language Model (LLM)-based coding agents show promise in automating software development tasks, yet they frequently fail in ways that are difficult for developers to understand and debug. While general-purpose LLMs like GPT can provide ad-hoc explanations of failures, raw execution traces remain challenging to interpret even for experienced developers. We present a systematic explainable AI (XAI) approach that transforms raw agent execution traces into structured, human-interpretable explanations. Our method consists of three key components: (1) a domain-specific failure taxonomy derived from analyzing real agent failures, (2) an automatic annotation system that classifies failures using defined annotation schema, (3) a hybrid explanation generator that produces visual execution flows, natural language explanations, and actionable recommendations. Through a user study with 20…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Software Engineering Research · Scientific Computing and Data Management
