GradientSpace: Unsupervised Data Clustering for Improved Instruction Tuning
Shrihari Sridharan, Deepak Ravikumar, Anand Raghunathan, Kaushik Roy

TL;DR
GradientSpace introduces a novel gradient clustering framework that enhances instruction tuning by creating specialized experts, leading to improved accuracy and reduced inference latency across diverse tasks.
Contribution
It proposes an online SVD-based gradient clustering method for full-dimensional gradient space, enabling effective expert specialization without high inference costs.
Findings
Outperforms prior clustering methods in accuracy.
Reduces inference latency compared to ensemble approaches.
Achieves consistent gains across multiple downstream tasks.
Abstract
Instruction tuning is one of the key steps required for adapting large language models (LLMs) to a broad spectrum of downstream applications. However, this procedure is difficult because real-world datasets are rarely homogeneous; they consist of a mixture of diverse information, causing gradient interference, where conflicting gradients pull the model in opposing directions, degrading performance. A common strategy to mitigate this issue is to group data based on semantic or embedding similarity. However, this fails to capture how data influences model parameters during learning. While recent works have attempted to cluster gradients directly, they randomly project gradients into lower dimensions to manage memory, which leads to accuracy loss. Moreover, these methods rely on expert ensembles which necessitates multiple inference passes and expensive on-the-fly gradient computations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Intelligent Tutoring Systems and Adaptive Learning
