Improving LLM Code Generation via Requirement-Aware Curriculum Reinforcement Learning
Shouyu Yin, Zhao Tian, Junjie Chen, Shikai Guo

TL;DR
This paper introduces RECRL, a requirement-aware curriculum reinforcement learning framework that enhances large language model code generation by better perceiving, optimizing, and sampling programming requirements.
Contribution
RECRL automatically perceives requirement difficulty, optimizes challenging requirements, and employs adaptive sampling to improve LLM code generation performance.
Findings
RECRL improves Pass@1 by 1.23%-5.62% over baselines.
RECRL effectively perceives requirement difficulty and optimizes training data.
Experiments on five LLMs and benchmarks validate RECRL's effectiveness.
Abstract
Code generation, which aims to automatically generate source code from given programming requirements, has the potential to substantially improve software development efficiency. With the rapid advancement of large language models (LLMs), LLM-based code generation has attracted widespread attention from both academia and industry. However, as programming requirements become increasingly complex, existing LLMs still exhibit notable performance limitations. To address this challenge, recent studies have proposed training-based curriculum reinforcement learning (CRL) strategies to improve LLM code generation performance. Despite their effectiveness, existing CRL approaches suffer from several limitations, including misaligned requirement difficulty perception, the absence of requirement difficulty optimization, and suboptimal curriculum sampling strategies. In CRL-based code generation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
