CRL-VLA: Continual Vision-Language-Action Learning

Qixin Zeng; Shuo Zhang; Hongyin Zhang; Renjie Wang; Han Zhao; Libang Zhao; Runze Li; Donglin Wang; Chao Huang

arXiv:2602.03445·cs.AI·February 4, 2026

CRL-VLA: Continual Vision-Language-Action Learning

Qixin Zeng, Shuo Zhang, Hongyin Zhang, Renjie Wang, Han Zhao, Libang Zhao, Runze Li, Donglin Wang, Chao Huang

PDF

Open Access

TL;DR

CRL-VLA introduces a continual learning framework for vision-language-action models that balances stability and plasticity, enabling lifelong robotic manipulation with theoretical guarantees and superior performance on benchmarks.

Contribution

It presents a novel continual reinforcement learning method with a dual-critic architecture and theoretical bounds, improving lifelong VLA model adaptation and retention.

Findings

01

Outperforms baselines on LIBERO benchmark

02

Effectively balances stability and plasticity

03

Provides theoretical performance bounds

Abstract

Lifelong learning is critical for embodied agents in open-world environments, where reinforcement learning fine-tuning has emerged as an important paradigm to enable Vision-Language-Action (VLA) models to master dexterous manipulation through environmental interaction. Thus, Continual Reinforcement Learning (CRL) is a promising pathway for deploying VLA models in lifelong robotic scenarios, yet balancing stability (retaining old skills) and plasticity (learning new ones) remains a formidable challenge for existing methods. We introduce CRL-VLA, a framework for continual post-training of VLA models with rigorous theoretical bounds. We derive a unified performance bound linking the stability-plasticity trade-off to goal-conditioned advantage magnitude, scaled by policy divergence. CRL-VLA resolves this dilemma via asymmetric regulation: constraining advantage magnitudes on prior tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning