CodeContests-O: Powering LLMs via Feedback-Driven Iterative Test Case Generation
Jianfeng Cai, Jinhua Zhu, Ruopei Sun, Kangwen Zhao, Dongyun Xue, Mingxiao Feng, Wengang Zhou, Houqiang Li

TL;DR
This paper introduces a feedback-driven iterative framework for generating high-quality, diverse test cases for programming problems, significantly improving verification accuracy and model fine-tuning performance.
Contribution
It proposes a novel feedback-driven method for test case generation that enhances diversity and discriminability, applied to the CodeContests dataset, and demonstrates improved verification and model fine-tuning results.
Findings
Achieved an average TPR of 89.37% and TNR of 90.89% on the entire solution pool.
Outperformed existing datasets by margins of 4.32% and 9.37%.
Fine-tuning models on the new dataset improved performance by 9.52% on LiveCodeBench.
Abstract
The rise of reasoning models necessitates large-scale verifiable data, for which programming tasks serve as an ideal source. However, while competitive programming platforms provide abundant problems and solutions, high-quality test cases for verification remain scarce. Existing approaches attempt to synthesize test cases using Large Language Models (LLMs), but rely solely on the model's intrinsic generation capabilities without external feedback, frequently resulting in insufficiently diverse cases. To address this limitation, we propose a for comprehensive test case construction. Specifically, our method leverages the LLM to generate initial test cases, executes them against known correct and incorrect solutions, and utilizes the failed results as feedback to guide the LLM in refining the test cases toward high fidelity and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Testing and Debugging Techniques · Software Engineering Techniques and Practices
