Editing Models with Task Arithmetic

Gabriel Ilharco; Marco Tulio Ribeiro; Mitchell Wortsman; Suchin; Gururangan; Ludwig Schmidt; Hannaneh Hajishirzi; Ali Farhadi

arXiv:2212.04089·cs.LG·April 3, 2023·31 cites

Editing Models with Task Arithmetic

Gabriel Ilharco, Marco Tulio Ribeiro, Mitchell Wortsman, Suchin, Gururangan, Ludwig Schmidt, Hannaneh Hajishirzi, Ali Farhadi

PDF

Open Access 5 Repos 10 Models 1 Video

TL;DR

This paper introduces task vectors as a way to steer pre-trained models' behavior through arithmetic operations in weight space, enabling task-specific improvements and multi-task performance without additional training.

Contribution

The work proposes a novel paradigm of task vectors for model editing, demonstrating their effectiveness through arithmetic operations like addition and negation to modify model behavior.

Findings

01

Negating a task vector reduces performance on that task.

02

Adding task vectors can improve multi-task performance.

03

Combining task vectors based on analogies enhances performance on related tasks.

Abstract

Changing how pre-trained models behave -- e.g., improving their performance on a downstream task or mitigating biases learned during pre-training -- is a common practice when developing machine learning systems. In this work, we propose a new paradigm for steering the behavior of neural networks, centered around \textit{task vectors}. A task vector specifies a direction in the weight space of a pre-trained model, such that movement in that direction improves performance on the task. We build task vectors by subtracting the weights of a pre-trained model from the weights of the same model after fine-tuning on a task. We show that these task vectors can be modified and combined together through arithmetic operations such as negation and addition, and the behavior of the resulting model is steered accordingly. Negating a task vector decreases performance on the target task, with little…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

Editing models with task arithmetic· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)