Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation
Kounianhua Du, Hanjing Wang, Jianxing Liu, Jizheng Chen, Xinyi Dai,, Yasheng Wang, Ruiming Tang, Yong Yu, Jun Wang, Weinan Zhang

TL;DR
This paper introduces a robust System2-to-System1 pipeline for code generation using a novel BDC framework, combining boosting, data disentanglement, and customization to improve reasoning and adaptability of LLMs.
Contribution
The paper proposes a new BDC framework with MC-Tree-Of-Agents, DisenLora algorithm, and input-aware hypernetworks to enhance LLM reasoning, robustness, and customization in code generation tasks.
Findings
Effective exploration of System 2 knowledge via mutual boosting.
Disentangling heterogeneous data improves LLM training.
Customized problem solvers enhance code generation accuracy.
Abstract
Large language models (LLMs) have demonstrated remarkable capabilities in various domains, particularly in system 1 tasks, yet the intricacies of their problem-solving mechanisms in system 2 tasks are not sufficiently explored. Recent research on System2-to-System1 methods surge, exploring the System 2 reasoning knowledge via inference-time computation and compressing the explored knowledge into System 1 process. In this paper, we focus on code generation, which is a representative System 2 task, and identify two primary challenges: (1) the complex hidden reasoning processes and (2) the heterogeneous data distributions that complicate the exploration and training of robust LLM solvers. To tackle these issues, we propose a novel BDC framework that explores insightful System 2 knowledge of LLMs using a MC-Tree-Of-Agents algorithm with mutual \textbf{B}oosting, \textbf{D}isentangles the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEmbedded Systems Design Techniques · Software Reliability and Analysis Research · Software Testing and Debugging Techniques
MethodsMonte-Carlo Tree Search · Focus · Pruning · HyperNetwork
