Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference

Deqian Kong; Dehong Xu; Minglu Zhao; Bo Pang; Jianwen Xie; Andrew Lizarraga; Yuhao Huang; Sirui Xie; Ying Nian Wu

arXiv:2402.04647·cs.LG·August 19, 2025·1 cites

Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference

Deqian Kong, Dehong Xu, Minglu Zhao, Bo Pang, Jianwen Xie, Andrew Lizarraga, Yuhao Huang, Sirui Xie, Ying Nian Wu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces the Latent Plan Transformer (LPT), a novel generative model that uses latent variables for trajectory planning, enabling better decision-making and adaptation in reinforcement learning tasks without relying on step-wise rewards.

Contribution

The paper proposes LPT, a new model that leverages latent variables and Transformer architecture to improve long-term planning from offline datasets, addressing temporal consistency challenges.

Findings

01

LPT achieves competitive performance on multiple benchmarks.

02

LPT demonstrates improved decision-making from sub-optimal trajectories.

03

LPT effectively handles trajectory stitching and environmental adaptation.

Abstract

In tasks aiming for long-term returns, planning becomes essential. We study generative modeling for planning with datasets repurposed from offline reinforcement learning. Specifically, we identify temporal consistency in the absence of step-wise rewards as one key technical challenge. We introduce the Latent Plan Transformer (LPT), a novel model that leverages a latent variable to connect a Transformer-based trajectory generator and the final return. LPT can be learned with maximum likelihood estimation on trajectory-return pairs. In learning, posterior sampling of the latent variable naturally integrates sub-trajectories to form a consistent abstraction despite the finite context. At test time, the latent variable is inferred from an expected return before policy execution, realizing the idea of planning as inference. Our experiments demonstrate that LPT can discover improved decisions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mingluzhao/latent-plan-transformer
pytorchOfficial

Videos

Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference· slideslive

Taxonomy

TopicsAI-based Problem Solving and Planning

MethodsAttention Is All You Need · Residual Connection · Dropout · Layer Normalization · Dense Connections · Position-Wise Feed-Forward Layer · Label Smoothing · Softmax · Absolute Position Encodings · Linear Layer