CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models

Wenxuan Song; Han Zhao; Fuhao Li; Ziyang Zhou; Xi Wang; Jing Lyu; Pengxiang Ding; Yan Wang; Donglin Wang; Haoang Li

arXiv:2605.10903·cs.CV·May 12, 2026

CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models

Wenxuan Song, Han Zhao, Fuhao Li, Ziyang Zhou, Xi Wang, Jing Lyu, Pengxiang Ding, Yan Wang, Donglin Wang, Haoang Li

PDF

1 Repo 1 Models

TL;DR

This paper introduces CapVector, a method to learn transferable capability vectors in parametric space for vision-language-action models, enhancing performance and adaptability with reduced computational costs.

Contribution

It decouples auxiliary training objectives in parameter space, enabling efficient transfer of capabilities and improved generalization in vision-language-action models.

Findings

01

Capability vectors are effective across diverse models.

02

Merged models perform comparably to auxiliary finetuned baselines.

03

Vectors generalize well to new environments and embodiments.

Abstract

This paper proposes a novel approach to address the challenge that pretrained VLA models often fail to effectively improve performance and reduce adaptation costs during standard supervised finetuning (SFT). Some advanced finetuning methods with auxiliary training objectives can improve performance and reduce the number of convergence steps. However, they typically incur significant computational overhead due to the additional losses from auxiliary objectives. To simultaneously achieve the enhanced capabilities of auxiliary training with the simplicity of standard SFT, we decouple the two objectives of auxiliary-objective SFT within the parameter space, namely, enhancing general capabilities and fitting task-specific action distributions. To deliver the goal, we only need to train the model to converge on a small-scale task set using two distinct training strategies, resulting in two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

openhelix-team/CapVector
github

Models

🤗
haofuly/capvector_models_collection
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.