Mixture of Horizons in Action Chunking

Dong Jing; Gang Wang; Jiaqi Liu; Weiliang Tang; Zelong Sun; Yunchao Yao; Zhenyu Wei; Yunhui Liu; Zhiwu Lu; Mingyu Ding

arXiv:2511.19433·cs.RO·November 25, 2025

Mixture of Horizons in Action Chunking

Dong Jing, Gang Wang, Jiaqi Liu, Weiliang Tang, Zelong Sun, Yunchao Yao, Zhenyu Wei, Yunhui Liu, Zhiwu Lu, Mingyu Ding

PDF

Open Access 2 Models

TL;DR

This paper introduces a mixture of horizons (MoH) strategy for vision-language-action models in robotic manipulation, combining different action chunk lengths to improve both local control and long-term foresight, leading to better performance and efficiency.

Contribution

The paper proposes a novel MoH approach that processes multiple action horizons in parallel, enabling dynamic horizon selection and joint exploitation of short-term and long-term information.

Findings

01

MoH improves performance on complex tasks in simulation and real-world.

02

MoH achieves 2.5x higher throughput with maintained accuracy.

03

State-of-the-art success rate of 99% on LIBERO with fewer training iterations.

Abstract

Vision-language-action (VLA) models have shown remarkable capabilities in robotic manipulation, but their performance is sensitive to the $action chunk length$ used during training, termed $horizon$ . Our empirical study reveals an inherent trade-off: longer horizons provide stronger global foresight but degrade fine-grained accuracy, while shorter ones sharpen local control yet struggle on long-term tasks, implying fixed choice of single horizons being suboptimal. To mitigate the trade-off, we propose a $mixture of horizons (MoH)$ strategy. MoH rearranges the action chunk into several segments with different horizons, processes them in parallel with a shared action transformer, and fuses outputs with a light linear gate. It has three appealing benefits. 1) MoH exploits long-term foresight and short-term precision jointly within a single model, improving both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Advanced Neural Network Applications