Beyond Similarity: A Gradient-based Graph Method for Instruction Tuning Data Selection
Yang Zhao, Li Du, Xiao Ding, Yangou Ouyang, Hepeng Wang, Kai Xiong,, Jinglong Gao, Zhouhao Sun, Dongliang Xu, Yang Qing, Dongchen Li, Bing Qin and, Ting Liu

TL;DR
G2IS is a novel gradient-based graph method for instruction data selection that captures instruction interdependencies, improving domain adaptation efficiency and performance in instruction tuning of large language models.
Contribution
Introduces G2IS, a gradient-based graph approach that models instruction relationships for better data selection in domain-specific instruction tuning.
Findings
G2IS outperforms traditional data selection methods.
Significant performance gains in data-scarce domain adaptation tasks.
Enhanced training efficiency and effectiveness with G2IS.
Abstract
Large language models (LLMs) have shown great potential across various industries due to their remarkable ability to generalize through instruction tuning. However, the limited availability of domain-specific data significantly hampers their performance on specialized tasks. While existing methods primarily focus on selecting training data from general datasets that are similar to the target domain, they often fail to consider the joint distribution of instructions, resulting in inefficient learning and suboptimal knowledge transfer. To address these challenges, we introduce G2IS (Gradient-based Graph Instruction Selection), a novel method that constructs a mixed gradient-based instruction graph to capture the joint distribution and interdependencies between instructions. By accounting for the relationships between instructions, G2IS improves domain adaptation efficiency. Additionally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Educational Technology and Assessment · Intelligent Tutoring Systems and Adaptive Learning
MethodsFocus
