AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search

Qingyao Li; Weiwen Liu; Weinan Zhang; Yong Yu; Bo An

arXiv:2604.10449·cs.SE·April 14, 2026

AdverMCTS: Combating Pseudo-Correctness in Code Generation via Adversarial Monte Carlo Tree Search

Qingyao Li, Weiwen Liu, Weinan Zhang, Yong Yu, Bo An

PDF

1 Repo

TL;DR

AdverMCTS introduces an adversarial Monte Carlo Tree Search framework that enhances code generation robustness by actively discovering vulnerabilities and reducing pseudo-correctness in LLM outputs.

Contribution

It presents a novel adversarial search method coupling code synthesis with vulnerability discovery to improve generalization beyond static test cases.

Findings

01

Outperforms state-of-the-art baselines in code generation tasks.

02

Reduces false positive rates by actively identifying logical flaws.

03

Encourages models to generalize beyond initial test constraints.

Abstract

Recent advancements in Large Language Models (LLMs) have successfully employed search-based strategies to enhance code generation. However, existing methods typically rely on static, sparse public test cases for verification, leading to pseudo-correctness -- where solutions overfit the visible public tests but fail to generalize to hidden test cases. We argue that optimizing against a fixed, weak environment inherently limits robustness. To address this, we propose AdverMCTS, a novel adversarial Monte Carlo Tree Search framework that combats pseudo-correctness by coupling code search with active vulnerability discovery. AdverMCTS formulates generation as a minimax-style game between a Solver agent, which synthesizes code candidates, and an Attacker agent, which evolves to generate targeted corner test cases that exploit logical divergences in the current code pool. These discovered…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://anonymous.4open.science/r/AdverMCTS_open-A255
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.