BACE: LLM-based Code Generation through Bayesian Anchored Co-Evolution of Code and Test Populations
Kaushitha Silva, Srinath Perera

TL;DR
BACE introduces a Bayesian co-evolutionary framework for code generation that leverages noisy test signals and minimal examples to improve the robustness and performance of LLM-based synthesis.
Contribution
It proposes a novel Bayesian co-evolution approach that models code and test populations as belief distributions, enhancing code generation robustness.
Findings
BACE outperforms existing methods on LiveCodeBench v6.
It improves code quality by effectively using noisy test signals.
BACE is effective across various language models, including small open-weight models.
Abstract
Large Language Models (LLMs) have demonstrated impressive capabilities in code generation. While an interactive feedback loop can improve performance, writing effective tests is a non-trivial task. Early multi-agent frameworks, such as AgentCoder, automated this process but relied on generated tests as absolute ground truth. This approach is fragile: incorrect code frequently passes faulty or trivial tests, while valid solutions are often degraded to satisfy incorrect assertions. Addressing this limitation, newer methods have largely abandoned test generation in favor of planning and reasoning based on examples. We argue, however, that generated tests remain a valuable signal if we model them as noisy sensors guided by bayesian updates. To this end, we introduce BACE (Bayesian Anchored Co-Evolution), a framework that reformulates synthesis as a Bayesian co-evolutionary process where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
