Cross-Model Transfer of Task Vectors via Few-Shot Orthogonal Alignment

Kazuhiko Kawamoto; Atsuhiro Endo; Hiroshi Kera

arXiv:2505.12021·cs.CV·May 20, 2025

Cross-Model Transfer of Task Vectors via Few-Shot Orthogonal Alignment

Kazuhiko Kawamoto, Atsuhiro Endo, Hiroshi Kera

PDF

Open Access 1 Repo

TL;DR

This paper introduces a few-shot orthogonal alignment method that enables effective transfer of task vectors between independently pre-trained models, improving transfer accuracy while preserving modularity.

Contribution

The paper proposes a novel orthogonal alignment technique for cross-model task vector transfer, addressing limitations of traditional task arithmetic in independently pre-trained models.

Findings

01

Improved transfer accuracy over direct task vector application.

02

Achieved performance comparable to few-shot fine-tuning.

03

Maintained modularity and reusability of task vectors.

Abstract

Task arithmetic enables efficient model editing by representing task-specific changes as vectors in parameter space. Task arithmetic typically assumes that the source and target models are initialized from the same pre-trained parameters. This assumption limits its applicability in cross-model transfer settings, where models are independently pre-trained on different datasets. To address this challenge, we propose a method based on few-shot orthogonal alignment, which aligns task vectors to the parameter space of a differently pre-trained target model. These transformations preserve key properties of task vectors, such as norm and rank, and are learned using only a small number of labeled examples. We evaluate the method using two Vision Transformers pre-trained on YFCC100M and LAION400M, and test on eight classification datasets. Experimental results show that our method improves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kawakera-lab/crossmodeltransfer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis