TestDecision: Sequential Test Suite Generation via Greedy Optimization and Reinforcement Learning
Guoqing Wang, Chengran Yang, Xiaoxuan Zhou, Zeyu Sun, Bo Wang, David Lo, Dan Hao

TL;DR
TestDecision introduces a novel reinforcement learning-guided greedy approach for open-source LLM-based test suite generation, significantly improving coverage and bug detection over existing methods.
Contribution
It formalizes test suite generation as a submodular optimization problem and develops a reinforcement learning framework to enhance LLMs for sequential test generation.
Findings
Achieves 38-52% higher branch coverage than existing methods.
Finds 58-95% more bugs compared to vanilla LLMs.
Outperforms proprietary models like GPT-5.2 on benchmark tests.
Abstract
With the rapid evolution of LLMs, automated software testing is witnessing a paradigm shift. While proprietary models like GPT-4o demonstrate impressive capabilities, their high deployment costs and data privacy concerns make open-source LLMs the practical imperative for many academic and industrial scenarios. In the field of automated test generation, it has evolved to iterative workflows to construct test suites based on LLMs. When utilizing open-source LLMs, we empirically observe they lack a suite-level perspective, suffering from structural myopia-failing to generate new tests with large marginal gain based on the current covered status. In this paper, from the perspective of sequences, we formalize test suite generation as a MDP and demonstrate that its objective exhibits monotone submodularity, which enables an effective relaxation of this NP-hard global optimization into a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
