Understanding and Enforcing Weight Disentanglement in Task Arithmetic

Shangge Liu; Yuehan Yin; Lei Wang; Qi Fan; Yinghuan Shi; Wenbin Li; Yang Gao; Dacheng Tao

arXiv:2604.17078·cs.AI·April 21, 2026

Understanding and Enforcing Weight Disentanglement in Task Arithmetic

Shangge Liu, Yuehan Yin, Lei Wang, Qi Fan, Yinghuan Shi, Wenbin Li, Yang Gao, Dacheng Tao

PDF

1 Repo 3 Models

TL;DR

This paper introduces TFS as a fundamental principle for weight disentanglement in task arithmetic, and proposes OrthoReg, a regularization method enforcing orthogonality to improve model editing.

Contribution

It establishes TFS as the core cause of weight disentanglement and orthogonality, and develops OrthoReg to promote these properties during fine-tuning.

Findings

01

OrthoReg significantly improves task arithmetic performance.

02

TFS is a sufficient condition for weight disentanglement.

03

Orthogonality of weight vectors correlates with disentanglement.

Abstract

Task arithmetic provides an efficient, training-free way to edit pre-trained models, yet lacks a fundamental theoretical explanation for its success. The existing concept of ``weight disentanglement" describes the ideal outcome of non-interfering task composition but does not reveal its underlying cause. Crucially, what intrinsic properties of the pre-trained model ( $θ_{0}$ ) or the task vectors ( $τ_{t}$ ) enable this disentanglement remains underexplored. In this paper, we introduce Task-Feature Specialization (TFS), a model's ability to allocate distinct internal features to different tasks, as the fundamental principle. We first prove that TFS is a sufficient condition for weight disentanglement. More importantly, we find that TFS also gives rise to an observable geometric consequence: weight vector orthogonality. This positions TFS as the common cause for both the desired…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RL-MIND/OrthoReg
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.