Adaptive Action Chunking at Inference-time for Vision-Language-Action Models

Yuanchang Liang; Xiaobo Wang; Kai Wang; Shuo Wang; Xiaojiang Peng; Haoyu Chen; David Kim Huat Chua; Prahlad Vadakkepat

arXiv:2604.04161·cs.RO·April 13, 2026

Adaptive Action Chunking at Inference-time for Vision-Language-Action Models

Yuanchang Liang, Xiaobo Wang, Kai Wang, Shuo Wang, Xiaojiang Peng, Haoyu Chen, David Kim Huat Chua, Prahlad Vadakkepat

PDF

1 Repo

TL;DR

This paper introduces an adaptive action chunking strategy for vision-language-action models that dynamically adjusts chunk size during inference to balance reactivity and consistency in robotic manipulation.

Contribution

The paper proposes a novel AAC method that uses action entropy to adaptively select chunk sizes, improving performance over fixed-length approaches.

Findings

01

Significant performance improvements on diverse robotic tasks.

02

Effective balance between responsiveness and stability achieved.

03

Source code and videos publicly available.

Abstract

In Vision-Language-Action (VLA) models, action chunking (i.e., executing a sequence of actions without intermediate replanning) is a key technique to improve robotic manipulation abilities. However, a large chunk size reduces the model's responsiveness to new information, while a small one increases the likelihood of mode-jumping, jerky behavior resulting from discontinuities between chunks. Therefore, selecting the optimal chunk size is an urgent demand to balance the model's reactivity and consistency. Unfortunately, a dominant trend in current VLA models is an empirical fixed chunk length at inference-time, hindering their superiority and scalability across diverse manipulation tasks. To address this issue, we propose a novel Adaptive Action Chunking (AAC) strategy, which exploits action entropy as the cue to adaptively determine the chunk size based on current predictions. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://lance-lot.github.io/adaptive-chunking.github.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.