The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution

Chen Qian; Peng Wang; Dongrui Liu; Junyao Yang; Dadi Guo; Ling Tang; Jilin Mei; Qihan Ren; Shuai Shao; Yong Liu; Jie Fu; Jing Shao; Xia Hu

arXiv:2601.15075·cs.AI·February 6, 2026

The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution

Chen Qian, Peng Wang, Dongrui Liu, Junyao Yang, Dadi Guo, Ling Tang, Jilin Mei, Qihan Ren, Shuai Shao, Yong Liu, Jie Fu, Jing Shao, Xia Hu

PDF

Open Access

TL;DR

This paper introduces a hierarchical framework for understanding the internal reasons behind agent actions in large language model-based systems, enhancing accountability and safety by identifying key internal factors regardless of success or failure.

Contribution

The paper presents a novel general agentic attribution framework that operates hierarchically to identify internal drivers of agent actions, applicable across diverse scenarios and independent of task outcomes.

Findings

01

Framework reliably identifies pivotal historical events influencing actions.

02

Effective in diverse scenarios including tool use and bias detection.

03

Enhances accountability in autonomous agent systems.

Abstract

Large Language Model (LLM)-based agents are widely used in real-world applications such as customer service, web navigation, and software engineering. As these systems become more autonomous and are deployed at scale, understanding why an agent takes a particular action becomes increasingly important for accountability and governance. However, existing research predominantly focuses on \textit{failure attribution} to localize explicit errors in unsuccessful trajectories, which is insufficient for explaining \textbf{the reason behind agent behaviors}. To bridge this gap, we propose a novel framework for \textbf{general agentic attribution}, designed to identify the internal factors driving agent actions regardless of the task outcome. Our framework operates hierarchically to manage the complexity of agent interactions. Specifically, at the \textit{component level}, we employ temporal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Ethics and Social Impacts of AI