DELIA: Diversity-Enhanced Learning for Instruction Adaptation in Large Language Models
Yuanhao Zeng, Fei Ren, Xinpeng Zhou, Yihang Wang, Yingxia Shao

TL;DR
This paper introduces DELIA, a novel data synthesis method that enhances instruction tuning in large language models by transforming biased features into more ideal, semantic-rich features through diverse data, leading to improved performance.
Contribution
DELIA is a new data synthesis approach that leverages diverse data to improve instruction adaptation in LLMs without prior knowledge of ideal features.
Findings
DELIA outperforms standard instruction tuning by 17-33% in translation quality.
DELIA improves accuracy by 36.1% in formatted text generation.
DELIA uniquely aligns new token representations with their prior semantics.
Abstract
Although instruction tuning is widely used to adjust behavior in Large Language Models (LLMs), extensive empirical evidence and research indicates that it is primarily a process where the model fits to specific task formats, rather than acquiring new knowledge or capabilities. We propose that this limitation stems from biased features learned during instruction tuning, which differ from ideal task-specfic features, leading to learn less underlying semantics in downstream tasks. However, ideal features are unknown and incalculable, constraining past work to rely on prior knowledge to assist reasoning or training, which limits LLMs' capabilities to the developers' abilities, rather than data-driven scalable learning. In our paper, through our novel data synthesis method, DELIA (Diversity-Enhanced Learning for Instruction Adaptation), we leverage the buffering effect of extensive diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics
MethodsALIGN
