Instruction Tuning with GPT-4

Baolin Peng; Chunyuan Li; Pengcheng He; Michel Galley and; Jianfeng Gao

arXiv:2304.03277·cs.CL·April 7, 2023·187 cites

Instruction Tuning with GPT-4

Baolin Peng, Chunyuan Li, Pengcheng He, Michel Galley and, Jianfeng Gao

PDF

Open Access 2 Repos 10 Models 5 Datasets

TL;DR

This paper explores using GPT-4 to generate instruction-following data for fine-tuning large language models, demonstrating improved zero-shot performance and providing a new approach to data creation without human input.

Contribution

It introduces GPT-4 generated instruction data for LLM fine-tuning, showing superior performance over previous data sources and providing resources for further research.

Findings

01

GPT-4 generated data improves zero-shot task performance

02

The dataset includes 52K instructions in English and Chinese

03

Public release of data and code for community use

Abstract

Prior work has shown that finetuning large language models (LLMs) using machine-generated instruction-following data enables such models to achieve remarkable zero-shot capabilities on new tasks, and no human-written instructions are needed. In this paper, we present the first attempt to use GPT-4 to generate instruction-following data for LLM finetuning. Our early experiments on instruction-tuned LLaMA models show that the 52K English and Chinese instruction-following data generated by GPT-4 leads to superior zero-shot performance on new tasks to the instruction-following data generated by previous state-of-the-art models. We also collect feedback and comparison data from GPT-4 to enable a comprehensive evaluation and reward model training. We make our data generated using GPT-4 as well as our codebase publicly available.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections