Lifecycle-Aware code generation: Leveraging Software Engineering Phases in LLMs
Xing Xing, Wei Wang, Lipeng Ma, Weidong Yang, and Junjie Zheng

TL;DR
This paper presents a lifecycle-aware framework for code generation using LLMs that incorporates software engineering phases, intermediate artifacts, and multi-step reasoning, significantly improving code correctness and performance.
Contribution
It introduces a structured, lifecycle-aware approach to LLM-based code generation that leverages intermediate artifacts and multi-step inference, enhancing accuracy and robustness.
Findings
Code correctness improved by up to 75% after fine-tuning.
Multi-step inference outperforms single-step generation.
Open-source LLMs match or outperform pretrained models after fine-tuning.
Abstract
Recent progress in large language models (LLMs) has advanced automatic code generation, yet most approaches rely on direct, single-step translation from problem descriptions to code, disregarding structured software engineering practices. We introduce a lifecycle-aware framework that systematically incorporates intermediate artifacts such as requirements analysis, state machine modeling, and pseudocode into both the training and inference stages. This design aligns code generation with standard software development phases and enables more structured reasoning. Experiments show that lifecycle-level fine-tuning improves code correctness by up to 75% over the same model before fine-tuning, with performance gains compounding across intermediate stages. Multi-step inference consistently surpasses single-step generation, demonstrating the effectiveness of intermediate scaffolding. Notably,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
