IterSelectTune: An Iterative Training Framework for Efficient   Instruction-Tuning Data Selection

Jielin Song; Siyu Liu; Bin Zhu; Yanghui Rao

arXiv:2410.13464·cs.CL·October 18, 2024

IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

Jielin Song, Siyu Liu, Bin Zhu, Yanghui Rao

PDF

Open Access

TL;DR

IterSelectTune is an innovative iterative training framework that efficiently selects high-quality instruction data for large language models, reducing human effort and computational costs while improving performance.

Contribution

The paper introduces IterSelectTune, a novel data selection method that outperforms full dataset fine-tuning with only 20% of data and minimal human involvement.

Findings

01

Outperforms models trained on full datasets across multiple benchmarks.

02

Reduces instruction tuning data requirements by 80%.

03

Requires minimal reliance on GPT-4 during data selection.

Abstract

As large language models (LLMs) continue to advance, instruction tuning has become critical for improving their ability to generate accurate and contextually appropriate responses. Although numerous instruction-tuning datasets have been developed to enhance LLM performance, selecting high-quality instruction data from large source datasets typically demands significant human effort. In this work, we introduce $IterSelectTune$ , an efficient, cost-effective iterative training policy for selecting high-quality instruction data with no human involvement and limited reliance on GPT-4. By fine-tuning on approximately 20\% of the source data, our method consistently outperforms models fine-tuned on the full dataset across multiple benchmarks and public test datasets. These results highlight the effectiveness of our approach in enhancing LLM performance while reducing the computational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics · Intelligent Tutoring Systems and Adaptive Learning · Educational Technology and Assessment

MethodsAdam · Attention Is All You Need · Dropout · Dense Connections · Layer Normalization · Residual Connection · Position-Wise Feed-Forward Layer · Linear Layer · Byte Pair Encoding · Absolute Position Encodings