In-Context Planning with Latent Temporal Abstractions
Baiting Luo, Yunuo Zhang, Nathaniel S. Keplinger, Samir Gupta, Abhishek Dubey, Ayan Mukhopadhyay

TL;DR
I-TAP is an offline RL framework that combines in-context adaptation with online planning by learning a discrete latent space of temporal abstractions, enabling efficient decision-making in complex, partially observable environments.
Contribution
It introduces a novel latent temporal abstraction space learned from offline data, facilitating in-context planning and adaptation without gradient updates during testing.
Findings
Outperforms strong offline baselines in MuJoCo and manipulation tasks.
Handles stochastic dynamics and partial observability effectively.
Enables efficient planning in high-dimensional, real-world scenarios.
Abstract
Planning-based reinforcement learning for continuous control is bottlenecked by two practical issues: planning at primitive time scales leads to prohibitive branching and long horizons, while real environments are frequently partially observable and exhibit regime shifts that invalidate stationary, fully observed dynamics assumptions. We introduce I-TAP (In-Context Latent Temporal-Abstraction Planner), an offline RL framework that unifies in-context adaptation with online planning in a learned discrete temporal-abstraction space. From offline trajectories, I-TAP learns an observation-conditioned residual-quantization VAE that compresses each observation-macro-action segment into a coarse-to-fine stack of discrete residual tokens, and a temporal Transformer that autoregressively predicts these token stacks from a short recent history. The resulting sequence model acts simultaneously as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning · Robot Manipulation and Learning
