Hierarchical Planning with Latent World Models

Wancong Zhang; Basile Terver; Artem Zholus; Soham Chitnis; Harsh Sutaria; Mido Assran; Randall Balestriero; Amir Bar; Adrien Bardes; Yann LeCun; Nicolas Ballas

arXiv:2604.03208·cs.LG·April 6, 2026

Hierarchical Planning with Latent World Models

Wancong Zhang, Basile Terver, Artem Zholus, Soham Chitnis, Harsh Sutaria, Mido Assran, Randall Balestriero, Amir Bar, Adrien Bardes, Yann LeCun, Nicolas Ballas

PDF

TL;DR

This paper introduces a hierarchical planning method with multi-scale latent world models that improves long-horizon control and zero-shot generalization in embodied tasks, reducing planning complexity and increasing success rates.

Contribution

It proposes a novel hierarchical approach that learns latent world models at multiple temporal scales, enabling efficient long-horizon reasoning across diverse domains.

Findings

01

Achieves 70% success rate on real-world pick-and-place tasks with only goal specification.

02

Outperforms single-level models in success rate and planning efficiency.

03

Requires up to 4x less planning time in simulated environments.

Abstract

Model predictive control (MPC) with learned world models has emerged as a promising paradigm for embodied control, particularly for its ability to generalize zero-shot when deployed in new environments. However, learned world models often struggle with long-horizon control due to the accumulation of prediction errors and the exponentially growing search space. In this work, we address these challenges by learning latent world models at multiple temporal scales and performing hierarchical planning across these scales, enabling long-horizon reasoning while substantially reducing inference-time planning complexity. Our approach serves as a modular planning abstraction that applies across diverse latent world-model architectures and domains. We demonstrate that this hierarchical approach enables zero-shot control on real-world non-greedy robotic tasks, achieving a 70% success rate on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.