An Iterative Test-and-Repair Framework for Competitive Code Generation
Lingxiao Tang, Muyang Ye, Zhaoyang Chu, Xiaoxue Ren, Zhongxin Liu, Lingfeng Bao, He Ye

TL;DR
This paper introduces FixAudit, an iterative test-and-repair framework for competitive code generation using LLMs, which outperforms existing methods by focusing on bug fixing and targeted testing.
Contribution
It proposes a novel iterative test-and-repair approach with a shared model for fixing and auditing code, improving performance over prior training-based methods like CURE.
Findings
FixAudit surpasses larger baseline models in zero-shot performance.
It improves Pass@1 by approximately 35% over strong baselines.
The framework achieves significant gains on multiple benchmarks.
Abstract
Large language models (LLMs) have made remarkable progress in code generation, but competitive programming remains a challenge. Recent training-based methods have improved code generation by using reinforcement learning (RL) with execution feedback. The more recent framework CURE further incorporates test generation into the training process, jointly training a Coder and a Tester within a single model. At inference time, the Coder generates many candidate programs, and the Tester generates tests from the problem description. The candidate who passes the most of the generated tests is selected as the final answer. However, CURE has two critical limitations. First, the Tester never reads any candidate code, so its tests often fail to expose implementation-specific bugs. Second, the Coder generates every candidate from scratch and never learns to fix a buggy program based on a failed test.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
