Mixture of Horizons in Action Chunking
Dong Jing, Gang Wang, Jiaqi Liu, Weiliang Tang, Zelong Sun, Yunchao Yao, Zhenyu Wei, Yunhui Liu, Zhiwu Lu, Mingyu Ding

TL;DR
This paper introduces a mixture of horizons (MoH) strategy for vision-language-action models in robotic manipulation, combining different action chunk lengths to improve both local control and long-term foresight, leading to better performance and efficiency.
Contribution
The paper proposes a novel MoH approach that processes multiple action horizons in parallel, enabling dynamic horizon selection and joint exploitation of short-term and long-term information.
Findings
MoH improves performance on complex tasks in simulation and real-world.
MoH achieves 2.5x higher throughput with maintained accuracy.
State-of-the-art success rate of 99% on LIBERO with fewer training iterations.
Abstract
Vision-language-action (VLA) models have shown remarkable capabilities in robotic manipulation, but their performance is sensitive to the used during training, termed . Our empirical study reveals an inherent trade-off: longer horizons provide stronger global foresight but degrade fine-grained accuracy, while shorter ones sharpen local control yet struggle on long-term tasks, implying fixed choice of single horizons being suboptimal. To mitigate the trade-off, we propose a strategy. MoH rearranges the action chunk into several segments with different horizons, processes them in parallel with a shared action transformer, and fuses outputs with a light linear gate. It has three appealing benefits. 1) MoH exploits long-term foresight and short-term precision jointly within a single model, improving both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Advanced Neural Network Applications
