Enhancing Code Generation Performance of Smaller Models by Distilling   the Reasoning Ability of LLMs

Zhihong Sun; Chen Lyu; Bolun Li; Yao Wan; Hongyu Zhang; Ge Li; Zhi Jin

arXiv:2403.13271·cs.SE·March 21, 2024·5 cites

Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs

Zhihong Sun, Chen Lyu, Bolun Li, Yao Wan, Hongyu Zhang, Ge Li, Zhi Jin

PDF

Open Access 1 Repo

TL;DR

This paper introduces CodePLAN, a distillation framework that transfers the reasoning abilities of large language models to smaller models, significantly enhancing their code generation performance by over 130% on the APPS benchmark.

Contribution

The paper presents a novel multi-task distillation method using backward reasoning and plan sampling to improve small models' code generation by mimicking LLM reasoning.

Findings

01

Over 130% improvement in pass@1 metric on APPS benchmark

02

Effective transfer of reasoning abilities from LLMs to smaller models

03

Enhanced code generation performance with multi-task distillation

Abstract

Large Language Models (LLMs) have recently made significant advances in code generation through the 'Chain-of-Thought' prompting technique. This technique empowers the model to autonomously devise "solution plans" to tackle intricate programming challenges, thereby improving its performance in code generation. Nevertheless, smaller models have been struggling to keep up with LLMs in deducing these plans, adversely affecting their code generation capabilities. Given the considerable size and associated deployment costs, along with concerns about data security, many teams opt for deploying smaller models for code generation. Consequently, there arises a compelling need for transferring LLMs' code generation reasoning abilities to the smaller models. In this paper, we propose the CodePLAN framework, which aims to transfer LLMs' reasoning capabilities to smaller models through distillation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sssszh/codeplan
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Model-Driven Software Engineering Techniques · Software Engineering Research