TDAD: Test-Driven Agentic Development - Reducing Code Regressions in AI Coding Agents via Graph-Based Impact Analysis
Pepe Alonso, Sergio Yovine, Victor A. Braberman

TL;DR
TDAD is a tool that reduces code regressions in AI coding agents by performing impact analysis to identify relevant tests before code changes, improving reliability and issue resolution.
Contribution
Introduces TDAD, a static impact analysis tool that enhances AI coding agents by reducing regressions through targeted test verification before code commits.
Findings
TDAD reduced regressions by 70% compared to baseline.
Adding TDD instructions without impact analysis increased regressions.
TDAD improved issue-resolution rate from 24% to 32%.
Abstract
AI coding agents can resolve real-world software issues, yet they frequently introduce regressions -- breaking tests that previously passed. Current benchmarks focus almost exclusively on resolution rate, leaving regression behavior under-studied. This paper presents TDAD (Test-Driven Agentic Development), an open-source tool that performs pre-change impact analysis for AI coding agents. TDAD builds a dependency map between source code and tests so that before committing a patch, the agent knows which tests to verify and can self-correct. The map is delivered as a lightweight agent skill -- a static text file the agent queries at runtime. Evaluated on SWE-bench Verified with two open-weight models running on consumer hardware (Qwen3-Coder 30B, 100 instances; Qwen3.5-35B-A3B, 25 instances), TDAD reduced regressions by 70% (6.08% to 1.82%) compared to a vanilla baseline. In contrast,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Ethics and Social Impacts of AI
