MapFormer: Self-Supervised Learning of Cognitive Maps with Input-Dependent Positional Embeddings
Victor Rambaud, Salvador Mascarenhas, Yair Lakretz

TL;DR
MapFormer introduces Transformer-based models that learn cognitive maps from data using input-dependent positional embeddings, enabling superior out-of-distribution generalization and scalable performance on formal and naturalistic tasks.
Contribution
The paper presents novel MapFormers with input-dependent positional encodings that unify absolute and relative positioning, improving cognitive map learning and OOD generalization.
Findings
MapFormers outperform existing models on formal cognitive tasks.
They achieve near-perfect OOD generalization where standard models fail.
Perplexity improvements on naturalistic data indicate scalability.
Abstract
A cognitive map is an internal model which encodes the abstract relationships among entities in the world, giving humans and animals the flexibility to adapt to new situations, with a strong out-of-distribution (OOD) generalization that current AI systems still do not possess. To bridge this gap, we introduce , new Transformer-based architectures, which can learn cognitive maps from observational data and perform path-integration without supervision. Cognitive maps are learned in the model by disentangling structural relationships in the inputs from their specific content, a property that can be achieved by updating position encodings with input-dependent matrices, built as exponentials of learned combinations of Lie-algebra generators. We developed two variants of that unify absolute and relative positional encoding to model episodic (EM) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
