SeDi-Instruct: Enhancing Alignment of Language Models through   Self-Directed Instruction Generation

Jungwoo Kim; Minsang Kim; Sungjin Lee

arXiv:2502.04774·cs.CL·February 10, 2025

SeDi-Instruct: Enhancing Alignment of Language Models through Self-Directed Instruction Generation

Jungwoo Kim, Minsang Kim, Sungjin Lee

PDF

Open Access

TL;DR

SeDi-Instruct introduces a cost-effective, high-quality instruction data generation framework that improves language model alignment by employing diversity filtering and iterative feedback, leading to better accuracy and reduced costs.

Contribution

It presents a novel data generation framework combining diversity filtering and iterative feedback to produce high-quality instruction data efficiently.

Findings

01

Models trained with SeDi-Instruct data improve accuracy by 5.2%.

02

Instruction data generation costs are reduced by 36%.

03

Enhanced instruction diversity maintains model performance without high costs.

Abstract

The rapid evolution of Large Language Models (LLMs) has enabled the industry to develop various AI-based services. Instruction tuning is considered essential in adapting foundation models for target domains to provide high-quality services to customers. A key challenge in instruction tuning is obtaining high-quality instruction data. Self-Instruct, which automatically generates instruction data using ChatGPT APIs, alleviates the data scarcity problem. To improve the quality of instruction data, Self-Instruct discards many of the instructions generated from ChatGPT, even though it is inefficient in terms of cost owing to many useless API calls. To generate high-quality instruction data at a low cost, we propose a novel data generation framework, Self-Direct Instruction generation (SeDi-Instruct), which employs diversity-based filtering and iterative feedback task generation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling