InverseCoder: Self-improving Instruction-Tuned Code LLMs with   Inverse-Instruct

Yutong Wu; Di Huang; Wenxuan Shi; Wei Wang; Lingzhe Gao; Shihao Liu,; Ziyuan Nan; Kaizhao Yuan; Rui Zhang; Xishan Zhang; Zidong Du; Qi Guo; Yewen; Pu; Dawei Yin; Xing Hu; Yunji Chen

arXiv:2407.05700·cs.CL·December 17, 2024

InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct

Yutong Wu, Di Huang, Wenxuan Shi, Wei Wang, Lingzhe Gao, Shihao Liu,, Ziyuan Nan, Kaizhao Yuan, Rui Zhang, Xishan Zhang, Zidong Du, Qi Guo, Yewen, Pu, Dawei Yin, Xing Hu, Yunji Chen

PDF

Open Access 1 Repo 4 Models 3 Datasets

TL;DR

This paper introduces Inverse-Instruct, a data augmentation method that uses a fine-tuned code LLM to generate additional instruction-response pairs, improving model performance without relying on expensive external data.

Contribution

It presents a novel self-improving technique for instruction tuning of open-source code LLMs by leveraging their own generated data, enhancing performance efficiently.

Findings

01

Consistent performance improvements across multiple models and benchmarks.

02

Effective augmentation of training data using inverse instruction generation.

03

Validation on diverse open-source models and datasets.

Abstract

Recent advancements in open-source code large language models (LLMs) have been driven by fine-tuning on the data generated from powerful closed-source LLMs, which are expensive to obtain. This paper explores whether it is possible to use a fine-tuned open-source model to generate additional data to augment its instruction-tuning dataset. We make two observations: (1) A code snippet can serve as the response to different instructions. (2) Instruction-tuned code LLMs perform better at translating code into instructions than the reverse. Based on these observations, we propose Inverse-Instruct, a data augmentation technique that uses a fine-tuned LLM to generate additional instructions of code responses from its own training dataset. The additional instruction-response pairs are added to the original dataset, and a stronger code LLM can be obtained by fine-tuning on the augmented dataset.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wyt2000/InverseCoder
pytorchOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVLSI and Analog Circuit Testing · Advancements in Photolithography Techniques · Advancements in Semiconductor Devices and Circuit Design

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Linear Layer · Label Smoothing · Adam · Dropout · Dense Connections · Weight Decay