Instruction Tuning with GPT-4
Baolin Peng, Chunyuan Li, Pengcheng He, Michel Galley and, Jianfeng Gao

TL;DR
This paper explores using GPT-4 to generate instruction-following data for fine-tuning large language models, demonstrating improved zero-shot performance and providing a new approach to data creation without human input.
Contribution
It introduces GPT-4 generated instruction data for LLM fine-tuning, showing superior performance over previous data sources and providing resources for further research.
Findings
GPT-4 generated data improves zero-shot task performance
The dataset includes 52K instructions in English and Chinese
Public release of data and code for community use
Abstract
Prior work has shown that finetuning large language models (LLMs) using machine-generated instruction-following data enables such models to achieve remarkable zero-shot capabilities on new tasks, and no human-written instructions are needed. In this paper, we present the first attempt to use GPT-4 to generate instruction-following data for LLM finetuning. Our early experiments on instruction-tuned LLaMA models show that the 52K English and Chinese instruction-following data generated by GPT-4 leads to superior zero-shot performance on new tasks to the instruction-following data generated by previous state-of-the-art models. We also collect feedback and comparison data from GPT-4 to enable a comprehensive evaluation and reward model training. We make our data generated using GPT-4 as well as our codebase publicly available.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗allenai/tulu-7bmodel· 43 dl· ♡ 943 dl♡ 9
- 🤗allenai/open-instruct-gpt4-alpaca-7bmodel· 25 dl· ♡ 125 dl♡ 1
- 🤗allenai/open-instruct-gpt4-alpaca-13bmodel· 14 dl· ♡ 114 dl♡ 1
- 🤗allenai/tulu-30bmodel· 41 dl· ♡ 1841 dl♡ 18
- 🤗allenai/tulu-65bmodel· 37 dl· ♡ 2137 dl♡ 21
- 🤗allenai/tulu-13bmodel· 25 dl· ♡ 825 dl♡ 8
- 🤗TheBloke/tulu-30B-GPTQmodel· 15 dl· ♡ 1015 dl♡ 10
- 🤗TheBloke/tulu-30B-fp16model· 829 dl· ♡ 5829 dl♡ 5
- 🤗TheBloke/tulu-30B-GGMLmodel· ♡ 9♡ 9
- 🤗TheBloke/tulu-13B-GGMLmodel· ♡ 7♡ 7
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections
