TL;DR
AdverMCTS introduces an adversarial Monte Carlo Tree Search framework that enhances code generation robustness by actively discovering vulnerabilities and reducing pseudo-correctness in LLM outputs.
Contribution
It presents a novel adversarial search method coupling code synthesis with vulnerability discovery to improve generalization beyond static test cases.
Findings
Outperforms state-of-the-art baselines in code generation tasks.
Reduces false positive rates by actively identifying logical flaws.
Encourages models to generalize beyond initial test constraints.
Abstract
Recent advancements in Large Language Models (LLMs) have successfully employed search-based strategies to enhance code generation. However, existing methods typically rely on static, sparse public test cases for verification, leading to pseudo-correctness -- where solutions overfit the visible public tests but fail to generalize to hidden test cases. We argue that optimizing against a fixed, weak environment inherently limits robustness. To address this, we propose AdverMCTS, a novel adversarial Monte Carlo Tree Search framework that combats pseudo-correctness by coupling code search with active vulnerability discovery. AdverMCTS formulates generation as a minimax-style game between a Solver agent, which synthesizes code candidates, and an Attacker agent, which evolves to generate targeted corner test cases that exploit logical divergences in the current code pool. These discovered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
