Training Data Selection with Gradient Orthogonality for Efficient Domain Adaptation
Xiyang Zhang, Yuanhe Tian, Hongzhi Wang, Yan Song

TL;DR
This paper introduces Orthogonal Gradient Selection (OGS), a novel data selection method that improves domain adaptation of large language models by ensuring gradient orthogonality, leading to better domain performance and efficiency without costly gradient projections.
Contribution
OGS shifts gradient safety from the optimizer to data selection using reinforcement learning and a lightweight Navigator, enabling efficient domain adaptation without incurring high computational costs.
Findings
OGS significantly improves domain-specific performance across multiple fields.
OGS maintains or enhances general reasoning capabilities like GSM8K.
OGS reduces training time compared to gradient projection methods.
Abstract
Fine-tuning large language models (LLMs) for specialized domains often necessitates a trade-off between acquiring domain expertise and retaining general reasoning capabilities, a phenomenon known as catastrophic forgetting. Existing remedies face a dichotomy: gradient surgery methods offer geometric safety but incur prohibitive computational costs via online projections, while efficient data selection approaches reduce overhead but remain blind to conflict-inducing gradient directions. In this paper, we propose Orthogonal Gradient Selection (OGS), a data-centric method that harmonizes domain performance, general capability retention, and training efficiency. OGS shifts the geometric insights of gradient projection from the optimizer to the data selection stage by treating data selection as a constrained decision-making process. By leveraging a lightweight Navigator model and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Multimodal Machine Learning Applications
