Latent Geometry Beyond Search: Amortizing Planning in World Models

Hoang Nguyen; Xiaohao Xu; Xiaonan Huang

arXiv:2605.08732·cs.RO·May 12, 2026

Latent Geometry Beyond Search: Amortizing Planning in World Models

Hoang Nguyen, Xiaohao Xu, Xiaonan Huang

PDF

TL;DR

This paper introduces a method to amortize planning in world models by learning a latent inverse-dynamics model, significantly reducing computation while maintaining or improving performance across various environments.

Contribution

It demonstrates that structured latent spaces enable replacing iterative planning with a learned inverse-dynamics model, simplifying control in vision-based world models.

Findings

01

The proposed GC-IDM matches or exceeds CEM performance in most benchmarks.

02

Per-decision cost is reduced by 100-130x with the new method.

03

The approach is robust across different test-time planners.

Abstract

Modern vision-based world models can represent observations as compact yet expressive latent manifolds, but fast goal-oriented planning in these spaces remains challenging. This raises a central question: when does a learned representation simplify control, rather than merely enabling prediction? We study this question in a pretrained LeWorldModel, whose latent geometry is regularized for smoothness and uniformity. Our key insight is that, under such geometry, planning can be amortized into a latent inverse-dynamics mapping instead of requiring online search. We therefore replace iterative planning with a lightweight Goal-Conditioned Inverse Dynamics Model (GC-IDM) that maps the current latent state, goal latent state, and remaining horizon directly to the next action. Empirically, across four benchmark environments spanning navigation, contact-rich manipulation, and continuous control,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.