Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models
Youwei Liu, Jian Wang, Hanlin Wang, Beichen Guo, Wenjie Li

TL;DR
This paper introduces Imagine-then-Plan (ITP), a framework enabling agents to perform multi-step lookahead planning with adaptive horizons using learned world models, significantly improving reasoning and performance in complex tasks.
Contribution
The paper proposes a novel adaptive lookahead mechanism within a unified framework for agent learning, enhancing multi-step planning with dynamic horizon adjustment.
Findings
ITP outperforms baseline methods on various benchmarks.
Adaptive lookahead improves agents' reasoning capabilities.
The framework is effective for complex, real-world tasks.
Abstract
Recent advances in world models have shown promise for modeling future dynamics of environmental states, enabling agents to reason and act without accessing real environments. Current methods mainly perform single-step or fixed-horizon rollouts, leaving their potential for complex task planning under-exploited. We propose Imagine-then-Plan (\texttt{ITP}), a unified framework for agent learning via lookahead imagination, where an agent's policy model interacts with the learned world model, yielding multi-step ``imagined'' trajectories. Since the imagination horizon may vary by tasks and stages, we introduce a novel adaptive lookahead mechanism by trading off the ultimate goal and task progress. The resulting imagined trajectories provide rich signals about future consequences, such as achieved progress and potential conflicts, which are fused with current observations, formulating a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
