Understanding Software Engineering Agents Through the Lens of Traceability: An Empirical Study
Ira Ceka, Saurabh Pujar, Shyam Ramji, Luca Buratti, Gail Kaiser, and Baishakhi Ray

TL;DR
This empirical study analyzes the decision-making workflows of software engineering agents using execution traces, proposing a taxonomy and examining core components like bug localization, patch generation, and test creation to improve agent effectiveness.
Contribution
It introduces the first taxonomy of SWE agent decision pathways, studies core components in depth, and compares agent-generated and human patches through large-scale analysis.
Findings
Identified three core components: bug localization, patch generation, test creation.
Analyzed strategies for successful test generation and patch production.
Compared agent and developer patches, revealing structural and stylistic differences.
Abstract
With the advent of large language models (LLMs), software engineering agents (SWE agents) have emerged as a powerful paradigm for automating a range of software tasks -- from code generation and repair to test case synthesis. These agents operate autonomously by interpreting user input and responding to environmental feedback. While various agent architectures have demonstrated strong empirical performance, the internal decision-making worfklows that drive their behavior remain poorly understood. Deeper insight into these workflows hold promise for improving both agent reliability and efficiency. In this work, we present the first systematic study of SWE agent behavior through the lens of execution traces. Our contributions are as follows: (1) we propose the first taxonomy of decision-making pathways across five representative agents; (2) using this taxonomy, we identify three core…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Techniques and Practices · Software Engineering Research · Software Testing and Debugging Techniques
