Path Integration and Object-Location Binding Emerge in an Action-Conditioned Predictive Sequence Network

Linda Ariel Ventura; Victoria Bosch; Tim C Kietzmann; Sushrut Thorat

arXiv:2602.03490·cs.LG·May 11, 2026

Path Integration and Object-Location Binding Emerge in an Action-Conditioned Predictive Sequence Network

Linda Ariel Ventura, Victoria Bosch, Tim C Kietzmann, Sushrut Thorat

PDF

TL;DR

This study demonstrates how a recurrent neural network can develop structured, flexible representations of objects and their relations through sequence prediction, supporting in-context learning and dynamic binding.

Contribution

The paper provides a mechanistic account of how structured, flexible object representations and path integration emerge in an action-conditioned predictive sequence network.

Findings

01

Prediction accuracy improves with sequence length on novel scenes.

02

Decoding reveals emergence of path integration and object-position binding.

03

New object bindings can be learned late and out-of-distribution bindings can be acquired.

Abstract

Adaptive cognition requires structured internal models of objects and their relations. Predictive neural networks are often proposed to learn such world models, but how these are instantiated and how they support prediction remain unclear. We investigate this in a minimal in-silico setting. A recurrent neural network samples tokens sequentially from 2D continuous token scenes and is trained to predict the upcoming token from the current input and a saccade-like displacement. On novel scenes, prediction accuracy improves across the sequence, indicating in-context learning. Decoding analyses reveal path integration and dynamic binding of token identity to position. Interventional analyses show that new bindings can be learned late in sequence and that out-of-distribution bindings can be learned as well. Together, these findings show how structured representations relying on flexible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.