MapFormer: Self-Supervised Learning of Cognitive Maps with Input-Dependent Positional Embeddings

Victor Rambaud; Salvador Mascarenhas; Yair Lakretz

arXiv:2511.19279·cs.LG·May 12, 2026

MapFormer: Self-Supervised Learning of Cognitive Maps with Input-Dependent Positional Embeddings

Victor Rambaud, Salvador Mascarenhas, Yair Lakretz

PDF

TL;DR

MapFormer introduces Transformer-based models that learn cognitive maps from data using input-dependent positional embeddings, enabling superior out-of-distribution generalization and scalable performance on formal and naturalistic tasks.

Contribution

The paper presents novel MapFormers with input-dependent positional encodings that unify absolute and relative positioning, improving cognitive map learning and OOD generalization.

Findings

01

MapFormers outperform existing models on formal cognitive tasks.

02

They achieve near-perfect OOD generalization where standard models fail.

03

Perplexity improvements on naturalistic data indicate scalability.

Abstract

A cognitive map is an internal model which encodes the abstract relationships among entities in the world, giving humans and animals the flexibility to adapt to new situations, with a strong out-of-distribution (OOD) generalization that current AI systems still do not possess. To bridge this gap, we introduce $MapFormers$ , new Transformer-based architectures, which can learn cognitive maps from observational data and perform path-integration without supervision. Cognitive maps are learned in the model by disentangling structural relationships in the inputs from their specific content, a property that can be achieved by updating position encodings with input-dependent matrices, built as exponentials of learned combinations of Lie-algebra generators. We developed two variants of $MapFormers$ that unify absolute and relative positional encoding to model episodic (EM) and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.