T2S: Tokenized Skill Scaling for Lifelong Imitation Learning

Hongquan Zhang; Jingyu Gong; Zhizhong Zhang; Xin Tan; Yanyun Qu; Yuan Xie

arXiv:2508.01167·cs.LG·August 5, 2025

T2S: Tokenized Skill Scaling for Lifelong Imitation Learning

Hongquan Zhang, Jingyu Gong, Zhizhong Zhang, Xin Tan, Yanyun Qu, Yuan Xie

PDF

Open Access

TL;DR

T2S introduces a unified tokenized framework for lifelong imitation learning that effectively prevents forgetting, scales new skills efficiently, and transfers knowledge across tasks with minimal parameter growth.

Contribution

The paper proposes Tokenized Skill Scaling (T2S), a novel transformer-based approach that enhances scalability and knowledge transfer in lifelong imitation learning.

Findings

01

Prevents catastrophic forgetting with 1.0% average NBT.

02

Scales new skills with only 8.0% trainable tokens.

03

Achieves 77.7% average FWT for knowledge transfer.

Abstract

The main challenge in lifelong imitation learning lies in the balance between mitigating catastrophic forgetting of previous skills while maintaining sufficient capacity for acquiring new ones. However, current approaches typically address these aspects in isolation, overlooking their internal correlation in lifelong skill acquisition. We address this limitation with a unified framework named Tokenized Skill Scaling (T2S). Specifically, by tokenizing the model parameters, the linear parameter mapping of the traditional transformer is transformed into cross-attention between input and learnable tokens, thereby enhancing model scalability through the easy extension of new tokens. Additionally, we introduce language-guided skill scaling to transfer knowledge across tasks efficiently and avoid linearly growing parameters. Extensive experiments across diverse tasks demonstrate that T2S: 1)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Robot Manipulation and Learning · Multimodal Machine Learning Applications