SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation
Bin Xu, Yiguan Lin, Yinghao Li, Yang Gao

TL;DR
SRA-MCTS enhances large language models' complex code generation by autonomously generating reasoning paths through Monte Carlo Tree Search, leading to improved accuracy and robustness without extra supervision.
Contribution
The paper introduces SRA-MCTS, a novel self-driven reasoning augmentation method that improves code generation performance by guiding models to generate high-quality reasoning paths autonomously.
Findings
Performance improvements across different model scales.
Robustness when traditional CoT degrades.
Enhanced diversity in solutions.
Abstract
Large language models demonstrate exceptional performance in simple code generation tasks but still face challenges in tackling complex problems. These challenges may stem from insufficient reasoning and problem decomposition capabilities. To address this issue, we propose a reasoning-augmented data generation process, SRA-MCTS, which guides the model to autonomously generate high-quality intermediate reasoning paths. This creates a positive feedback loop, enabling continuous improvement. Our method operates entirely through the model itself without requiring additional supervision. By synthesizing natural language reasoning paths and translating them into executable code, the approach ensures analytical accuracy and enhances the success rate in solving complex tasks. Experimental results show that, even without additional supervisory signals, our method achieves performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Natural Language Processing Techniques · Software Engineering Research
