CodeChain: Towards Modular Code Generation Through Chain of   Self-revisions with Representative Sub-modules

Hung Le; Hailin Chen; Amrita Saha; Akash Gokul; Doyen Sahoo; Shafiq; Joty

arXiv:2310.08992·cs.AI·March 15, 2024·2 cites

CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules

Hung Le, Hailin Chen, Amrita Saha, Akash Gokul, Doyen Sahoo, Shafiq, Joty

PDF

Open Access 1 Repo

TL;DR

CodeChain introduces a modular code generation framework using self-revisions and representative sub-modules, significantly improving correctness and reusability in complex programming tasks for large language models.

Contribution

The paper presents a novel self-revision based approach for modular code generation that enhances reusability and correctness in LLM-generated solutions.

Findings

01

Achieves 35% relative improvement on pass@1 for APPS

02

Achieves 76% relative improvement on pass@1 for CodeContests

03

Effective on both OpenAI and open-source LLMs

Abstract

Large Language Models (LLMs) have already become quite proficient at solving simpler programming tasks like those in HumanEval or MBPP benchmarks. However, solving more complex and competitive programming tasks is still quite challenging for these models - possibly due to their tendency to generate solutions as monolithic code blocks instead of decomposing them into logical sub-tasks and sub-modules. On the other hand, experienced programmers instinctively write modularized code with abstraction for solving complex tasks, often reusing previously developed modules. To address this gap, we propose CodeChain, a novel framework for inference that elicits modularized code generation through a chain of self-revisions, each being guided by some representative sub-modules generated in previous iterations. Concretely, CodeChain first instructs the LLM to generate modularized codes through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SalesforceAIResearch/CodeChain
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software System Performance and Reliability · Topic Modeling