RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation

Changzhi Zhou; Xinyu Zhang; Dandan Song; Xiancai Chen; Wanli Gu; Huipeng Ma; Yuhang Tian; Mengdi Zhang; Linmei Hu

arXiv:2502.09183·cs.CL·August 22, 2025

RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation

Changzhi Zhou, Xinyu Zhang, Dandan Song, Xiancai Chen, Wanli Gu, Huipeng Ma, Yuhang Tian, Mengdi Zhang, Linmei Hu

PDF

Open Access

TL;DR

RefineCoder introduces an iterative self-improvement method for large language models in code generation, utilizing adaptive critique and refinement to enhance performance without relying solely on teacher model imitation.

Contribution

The paper proposes Adaptive Critique Refinement (ACR), a novel iterative self-refinement approach for LLMs that improves code quality through self-generated critique, surpassing traditional fine-tuning methods.

Findings

01

Achieves continuous performance improvement on multiple benchmarks.

02

Outperforms baselines of the same size with less data.

03

Demonstrates the effectiveness of self-critique in code generation.

Abstract

Code generation has attracted increasing attention with the rise of Large Language Models (LLMs). Many studies have developed powerful code LLMs by synthesizing code-related instruction data and applying supervised fine-tuning. However, these methods are limited by teacher model distillation and ignore the potential of iterative refinement by self-generated code. In this paper, we propose Adaptive Critique Refinement (ACR), which enables the model to refine itself by self-generated code and external critique, rather than directly imitating the code responses of the teacher model. Concretely, ACR includes a composite scoring system with LLM-as-a-Judge to evaluate the quality of code responses and a selective critique strategy with LLM-as-a-Critic to critique self-generated low-quality code responses. We develop the RefineCoder series by iteratively applying ACR, achieving continuous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsSoftmax · Attention Is All You Need