Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation
Da Yin, Xiao Liu, Fan Yin, Ming Zhong, Hritik Bansal, Jiawei Han,, Kai-Wei Chang

TL;DR
Dynosaur introduces a cost-effective, automated method for expanding instruction-tuning datasets by leveraging existing datasets and LLMs, improving data quality and supporting continual model enhancement.
Contribution
It presents a novel dynamic growth paradigm that automatically constructs instruction-tuning data from existing datasets, reducing costs and enabling continual learning.
Findings
Outperforms Alpaca and Flan on Super-NI and Longform datasets.
Cost less than $12 USD for 800K samples using GPT-3.5-turbo.
Replaying diverse tasks mitigates forgetting and improves generalization.
Abstract
Instruction tuning has emerged to enhance the capabilities of large language models (LLMs) to comprehend instructions and generate appropriate responses. Existing methods either manually annotate or employ LLM (e.g., GPT-series) to generate data for instruction tuning. However, they often overlook associating instructions with existing annotated datasets. In this paper, we propose Dynosaur, a dynamic growth paradigm for the automatic curation of instruction-tuning data. Based on the metadata of existing datasets, we use LLMs to automatically construct instruction-tuning data by identifying relevant data fields and generating appropriate instructions. By leveraging the existing annotated datasets, Dynosaur offers several advantages: 1) it reduces the API cost for generating instructions (e.g., it costs less than $12 USD by calling GPT-3.5-turbo for generating 800K instruction tuning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification
Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · Weight Decay · Linear Warmup With Cosine Annealing · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · GPT-3 · Absolute Position Encodings
