DELIA: Diversity-Enhanced Learning for Instruction Adaptation in Large   Language Models

Yuanhao Zeng; Fei Ren; Xinpeng Zhou; Yihang Wang; Yingxia Shao

arXiv:2408.10841·cs.AI·August 21, 2024

DELIA: Diversity-Enhanced Learning for Instruction Adaptation in Large Language Models

Yuanhao Zeng, Fei Ren, Xinpeng Zhou, Yihang Wang, Yingxia Shao

PDF

Open Access

TL;DR

This paper introduces DELIA, a novel data synthesis method that enhances instruction tuning in large language models by transforming biased features into more ideal, semantic-rich features through diverse data, leading to improved performance.

Contribution

DELIA is a new data synthesis approach that leverages diverse data to improve instruction adaptation in LLMs without prior knowledge of ideal features.

Findings

01

DELIA outperforms standard instruction tuning by 17-33% in translation quality.

02

DELIA improves accuracy by 36.1% in formatted text generation.

03

DELIA uniquely aligns new token representations with their prior semantics.

Abstract

Although instruction tuning is widely used to adjust behavior in Large Language Models (LLMs), extensive empirical evidence and research indicates that it is primarily a process where the model fits to specific task formats, rather than acquiring new knowledge or capabilities. We propose that this limitation stems from biased features learned during instruction tuning, which differ from ideal task-specfic features, leading to learn less underlying semantics in downstream tasks. However, ideal features are unknown and incalculable, constraining past work to rely on prior knowledge to assist reasoning or training, which limits LLMs' capabilities to the developers' abilities, rather than data-driven scalable learning. In our paper, through our novel data synthesis method, DELIA (Diversity-Enhanced Learning for Instruction Adaptation), we leverage the buffering effect of extensive diverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics

MethodsALIGN