DPI: Exploiting Parameter Heterogeneity for Interference-Free Fine-Tuning
Xiaoyu Liu, Xiaoyu Guan, Di Liang, Xianjie Wu

TL;DR
This paper introduces DPI, a method that isolates task-specific parameters during fine-tuning of large language models to prevent interference and improve performance across multiple tasks.
Contribution
DPI proposes a novel parameter isolation approach that identifies and freezes core parameters per task, reducing cross-task interference during fine-tuning.
Findings
Consistently reduces data conflicts in multi-task fine-tuning
Achieves performance improvements over baseline methods
Effectively isolates task-specific parameter regions
Abstract
Supervised fine-tuning (SFT) is a crucial step for adapting large language models (LLMs) to downstream tasks. However, conflicting objectives across heterogeneous SFT tasks often induce the "seesaw effect": optimizing for one task may degrade performance on others, particularly when model parameters are updated indiscriminately. In this paper, we propose a principled approach to disentangle and isolate task-specific parameter regions, motivated by the hypothesis that parameter heterogeneity underlies cross-task interference. Specifically, we first independently fine-tune LLMs on diverse SFT tasks and identify each task's core parameter region as the subset of parameters exhibiting the largest updates. Tasks with highly overlapping core parameter regions are merged for joint training, while disjoint tasks are organized into different stages. During multi-stage SFT, core parameters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
