TL;DR
This paper introduces TFS as a fundamental principle for weight disentanglement in task arithmetic, and proposes OrthoReg, a regularization method enforcing orthogonality to improve model editing.
Contribution
It establishes TFS as the core cause of weight disentanglement and orthogonality, and develops OrthoReg to promote these properties during fine-tuning.
Findings
OrthoReg significantly improves task arithmetic performance.
TFS is a sufficient condition for weight disentanglement.
Orthogonality of weight vectors correlates with disentanglement.
Abstract
Task arithmetic provides an efficient, training-free way to edit pre-trained models, yet lacks a fundamental theoretical explanation for its success. The existing concept of ``weight disentanglement" describes the ideal outcome of non-interfering task composition but does not reveal its underlying cause. Crucially, what intrinsic properties of the pre-trained model () or the task vectors () enable this disentanglement remains underexplored. In this paper, we introduce Task-Feature Specialization (TFS), a model's ability to allocate distinct internal features to different tasks, as the fundamental principle. We first prove that TFS is a sufficient condition for weight disentanglement. More importantly, we find that TFS also gives rise to an observable geometric consequence: weight vector orthogonality. This positions TFS as the common cause for both the desired…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
