DPI: Exploiting Parameter Heterogeneity for Interference-Free Fine-Tuning

Xiaoyu Liu; Xiaoyu Guan; Di Liang; Xianjie Wu

arXiv:2601.17777·cs.CL·January 27, 2026

DPI: Exploiting Parameter Heterogeneity for Interference-Free Fine-Tuning

Xiaoyu Liu, Xiaoyu Guan, Di Liang, Xianjie Wu

PDF

Open Access

TL;DR

This paper introduces DPI, a method that isolates task-specific parameters during fine-tuning of large language models to prevent interference and improve performance across multiple tasks.

Contribution

DPI proposes a novel parameter isolation approach that identifies and freezes core parameters per task, reducing cross-task interference during fine-tuning.

Findings

01

Consistently reduces data conflicts in multi-task fine-tuning

02

Achieves performance improvements over baseline methods

03

Effectively isolates task-specific parameter regions

Abstract

Supervised fine-tuning (SFT) is a crucial step for adapting large language models (LLMs) to downstream tasks. However, conflicting objectives across heterogeneous SFT tasks often induce the "seesaw effect": optimizing for one task may degrade performance on others, particularly when model parameters are updated indiscriminately. In this paper, we propose a principled approach to disentangle and isolate task-specific parameter regions, motivated by the hypothesis that parameter heterogeneity underlies cross-task interference. Specifically, we first independently fine-tune LLMs on diverse SFT tasks and identify each task's core parameter region as the subset of parameters exhibiting the largest updates. Tasks with highly overlapping core parameter regions are merged for joint training, while disjoint tasks are organized into different stages. During multi-stage SFT, core parameters…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications