Learning Cognitive Maps from Transformer Representations for Efficient Planning in Partially Observed Environments
Antoine Dedieu, Wolfgang Lehrach, Guangyao Zhou, Dileep George, Miguel, L\'azaro-Gredilla

TL;DR
This paper introduces a transformer-based model with discrete bottlenecks that learns interpretable cognitive maps from observations in partially observed environments, enabling efficient path planning and superior in-context accuracy.
Contribution
The paper presents TDB, a transformer with discrete bottlenecks that learns explicit cognitive maps for planning in partially observed environments, outperforming vanilla transformers in speed and interpretability.
Findings
TDB retains high predictive performance while enabling fast path planning.
TDB extracts interpretable cognitive maps from text datasets.
TDB achieves near-perfect in-context accuracy and effective path planning in POEs.
Abstract
Despite their stellar performance on a wide range of tasks, including in-context tasks only revealed during inference, vanilla transformers and variants trained for next-token predictions (a) do not learn an explicit world model of their environment which can be flexibly queried and (b) cannot be used for planning or navigation. In this paper, we consider partially observed environments (POEs), where an agent receives perceptually aliased observations as it navigates, which makes path planning hard. We introduce a transformer with (multiple) discrete bottleneck(s), TDB, whose latent codes learn a compressed representation of the history of observations and actions. After training a TDB to predict the future observation(s) given the history, we extract interpretable cognitive maps of the environment from its active bottleneck(s) indices. These maps are then paired with an external solver…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems · Advanced Image and Video Retrieval Techniques · Cognitive Computing and Networks
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
