Decomposing Task Vectors for Refined Model Editing
Hamed Damirchi, Ehsan Abbasnejad, Zhen Zhang, Javen Shi

TL;DR
This paper introduces a decomposition method for task vectors in large models, enabling precise control over concept-specific behaviors and improving multi-task merging, style mixing, and toxicity reduction.
Contribution
It proposes a novel decomposition technique that separates shared and unique knowledge in task vectors, enhancing model editing capabilities.
Findings
Improved multi-task merging in image classification by 5%.
Enabled clean style mixing in diffusion models without degradation.
Achieved 47% reduction in toxicity in language models.
Abstract
Large pre-trained models have transformed machine learning, yet adapting these models effectively to exhibit precise, concept-specific behaviors remains a significant challenge. Task vectors, defined as the difference between fine-tuned and pre-trained model parameters, provide a mechanism for steering neural networks toward desired behaviors. This has given rise to large repositories dedicated to task vectors tailored for specific behaviors. The arithmetic operation of these task vectors allows for the seamless combination of desired behaviors without the need for large datasets. However, these vectors often contain overlapping concepts that can interfere with each other during arithmetic operations, leading to unpredictable outcomes. We propose a principled decomposition method that separates each task vector into two components: one capturing shared knowledge across multiple task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning
