CKT-WAM: Parameter-Efficient Context Knowledge Transfer Between World Action Models

Yuhua Jiang; Yijun Guo; Hongbing Yang; Guojun Lei; Nuo Chen; Yinuo Zhang; Shaoqiang Yan; Bo Lin; Feifei Gao; Biqing Qi

arXiv:2605.06247·cs.RO·May 8, 2026

CKT-WAM: Parameter-Efficient Context Knowledge Transfer Between World Action Models

Yuhua Jiang, Yijun Guo, Hongbing Yang, Guojun Lei, Nuo Chen, Yinuo Zhang, Shaoqiang Yan, Bo Lin, Feifei Gao, Biqing Qi

PDF

1 Repo

TL;DR

CKT-WAM introduces a parameter-efficient framework for transferring knowledge between heterogeneous world action models by embedding context in text space, enhancing zero-shot generalization and real-world manipulation.

Contribution

It proposes a novel context knowledge transfer method that reduces adaptation cost and improves performance with minimal parameter updates.

Findings

01

Achieves 86.1% success rate on LIBERO-Plus with only 1.17% trainable parameters.

02

Demonstrates strong real-world long-horizon manipulation with 83.3% success rate.

03

Outperforms existing methods in zero-shot generalization and multi-task settings.

Abstract

World action models (WAMs) provide a powerful generative framework for embodied control, yet transferring knowledge across heterogeneous WAMs remains challenging due to mismatched latent interfaces, high adaptation cost, and the rigidity of conventional distillation objectives. We propose \textbf{CKT-WAM}, a parameter-efficient \textbf{C}ontext \textbf{K}nowledge \textbf{T}ransfer framework that transfers teacher WAM's knowledge into a student WAM through a compact context in the text embedding space, rather than output imitation or dense hidden-state matching. Specifically, CKT-WAM extracts intermediate teacher hidden states, reduces the number of tokens via compressors' learnable-query cross attention (LQCA), and transforms them through an always-on generalized adapter, a lightweight router, and sparsely activated specialized adapters. The resulting context is then appended to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YuhuaJiang2002/CKT-WAM
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.