Blueprint2Code: a multi-agent pipeline for reliable code generation via blueprint planning and repair
Kehao Mao, Baokun Hu, Ruixin Lin, Zewen Li, Guanyu Lu, Zhengyu Zhang

TL;DR
Blueprint2Code is a multi-agent system that improves code generation by mimicking human programming workflows, leading to better performance on complex tasks.
Contribution
Introduces a novel multi-agent framework that enhances code generation through coordinated task planning, implementation, and debugging.
Findings
Blueprint2Code outperforms existing methods on benchmark datasets like HumanEval and MBPP.
It achieves high pass@1 scores, including 96.3% on HumanEval and 88.4% on MBPP.
The system shows robustness on extended and complex programming tasks.
Abstract
Automated programming has become a powerful tool for solving real-world problems. Code generation, in particular, plays a key role in improving developer productivity and reducing the entry barrier to software development. Recent advances in large language models (LLMs) have significantly improved program synthesis, enabling high-quality code generation from natural language. However, LLMs still struggle with complex tasks, especially in understanding problem intent, conducting multi-step reasoning, and producing code that passes all test cases. As task difficulty increases, existing models often fail to devise complete and reliable generation strategies, leading to reduced accuracy and robustness. To address these limitations, we propose Blueprint2Code, an innovative multi-agent framework for code generation. It emulates the human programming workflow through the coordinated…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Advanced Malware Detection Techniques · Model-Driven Software Engineering Techniques
