AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models

Yuhua Jiang; Shuang Cheng; Yan Ding; Feifei Gao; Biqing Qi

arXiv:2511.14148·cs.RO·May 8, 2026

AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models

Yuhua Jiang, Shuang Cheng, Yan Ding, Feifei Gao, Biqing Qi

PDF

1 Repo

TL;DR

AsyncVLA introduces an asynchronous flow matching framework for vision-language-action models, enabling flexible, self-correcting action generation that improves robotic manipulation performance in long-horizon tasks.

Contribution

It proposes a novel asynchronous flow matching approach with self-correction and confidence-based refinement, enhancing VLA models' stability and efficiency.

Findings

01

Outperforms existing methods on robotic benchmarks.

02

Demonstrates data efficiency and self-correction in experiments.

03

Works effectively in both simulation and real-world settings.

Abstract

Vision-language-action (VLA) models have recently emerged as a powerful paradigm for building generalist robots. However, traditional VLA models that generate actions through flow matching (FM) typically rely on rigid and uniform time schedules, i.e., synchronous FM (SFM). Without action context awareness and asynchronous self-correction, SFM becomes unstable in long-horizon tasks, where a single action error can cascade into failure. In this work, we propose asynchronous flow matching VLA (AsyncVLA), a novel framework that introduces temporal flexibility in asynchronous FM (AFM) and enables self-correction in action generation. AsyncVLA breaks from the vanilla SFM in VLA models by generating the action tokens in a non-uniform time schedule with action context awareness. Besides, our method introduces the confidence rater to extract confidence of the initially generated actions,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YuhuaJiang2002/AsyncVLA
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.