Training Language Models to Generate Quality Code with Program Analysis Feedback

Feng Yao; Zilong Wang; Liyuan Liu; Junxia Cui; Li Zhong; Xiaohan Fu; Haohui Mai; Vish Krishnan; Jianfeng Gao; Jingbo Shang

arXiv:2505.22704·cs.CL·May 30, 2025

Training Language Models to Generate Quality Code with Program Analysis Feedback

Feng Yao, Zilong Wang, Liyuan Liu, Junxia Cui, Li Zhong, Xiaohan Fu, Haohui Mai, Vish Krishnan, Jianfeng Gao, Jingbo Shang

PDF

Open Access 1 Repo

TL;DR

This paper introduces REAL, a reinforcement learning framework that uses program analysis and unit tests to guide large language models in generating secure, maintainable, and functionally correct code without manual annotations.

Contribution

We propose a prompt-agnostic, reference-free reinforcement learning approach that improves code quality and security in LLM-generated code through automated feedback signals.

Findings

01

REAL outperforms existing methods in code quality and functionality.

02

The framework is scalable and does not require manual annotations.

03

Experiments show improved security and maintainability in generated code.

Abstract

Code generation with large language models (LLMs), often termed vibe coding, is increasingly adopted in production but fails to ensure code quality, particularly in security (e.g., SQL injection vulnerabilities) and maintainability (e.g., missing type annotations). Existing methods, such as supervised fine-tuning and rule-based post-processing, rely on labor-intensive annotations or brittle heuristics, limiting their scalability and effectiveness. We propose REAL, a reinforcement learning framework that incentivizes LLMs to generate production-quality code using program analysis-guided feedback. Specifically, REAL integrates two automated signals: (1) program analysis detecting security or maintainability defects and (2) unit tests ensuring functional correctness. Unlike prior work, our framework is prompt-agnostic and reference-free, enabling scalable supervision without manual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yaof20/real
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Advanced Malware Detection Techniques · Security and Verification in Computing

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings