Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference
Deqian Kong, Dehong Xu, Minglu Zhao, Bo Pang, Jianwen Xie, Andrew Lizarraga, Yuhao Huang, Sirui Xie, Ying Nian Wu

TL;DR
This paper introduces the Latent Plan Transformer (LPT), a novel generative model that uses latent variables for trajectory planning, enabling better decision-making and adaptation in reinforcement learning tasks without relying on step-wise rewards.
Contribution
The paper proposes LPT, a new model that leverages latent variables and Transformer architecture to improve long-term planning from offline datasets, addressing temporal consistency challenges.
Findings
LPT achieves competitive performance on multiple benchmarks.
LPT demonstrates improved decision-making from sub-optimal trajectories.
LPT effectively handles trajectory stitching and environmental adaptation.
Abstract
In tasks aiming for long-term returns, planning becomes essential. We study generative modeling for planning with datasets repurposed from offline reinforcement learning. Specifically, we identify temporal consistency in the absence of step-wise rewards as one key technical challenge. We introduce the Latent Plan Transformer (LPT), a novel model that leverages a latent variable to connect a Transformer-based trajectory generator and the final return. LPT can be learned with maximum likelihood estimation on trajectory-return pairs. In learning, posterior sampling of the latent variable naturally integrates sub-trajectories to form a consistent abstraction despite the finite context. At test time, the latent variable is inferred from an expected return before policy execution, realizing the idea of planning as inference. Our experiments demonstrate that LPT can discover improved decisions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAI-based Problem Solving and Planning
MethodsAttention Is All You Need · Residual Connection · Dropout · Layer Normalization · Dense Connections · Position-Wise Feed-Forward Layer · Label Smoothing · Softmax · Absolute Position Encodings · Linear Layer
