Next state prediction gives rise to entangled, yet compositional representations of objects
Tankred Saanum, Luca M. Schulze Buschoff, Peter Dayan, Eric Schulz

TL;DR
This paper investigates whether distributed neural representations, trained on videos, can develop linearly separable object encodings comparable to slot-based models, revealing that such models can support effective compositional generalization without explicit priors.
Contribution
It demonstrates that distributed models can learn linearly separable object representations through unsupervised training, challenging the assumption that object-centric priors are necessary for compositional generalization.
Findings
Distributed models often match or outperform slot-based models in prediction tasks.
Linearly separable object representations can emerge without explicit object priors.
Object representations are partially overlapping but highly separable, aiding generalization.
Abstract
Compositional representations are thought to enable humans to generalize across combinatorially vast state spaces. Models with learnable object slots, which encode information about objects in separate latent codes, have shown promise for this type of generalization but rely on strong architectural priors. Models with distributed representations, on the other hand, use overlapping, potentially entangled neural codes, and their ability to support compositional generalization remains underexplored. In this paper we examine whether distributed models can develop linearly separable representations of objects, like slotted models, through unsupervised training on videos of object interactions. We show that, surprisingly, models with distributed representations often match or outperform models with object slots in downstream prediction tasks. Furthermore, we find that linearly separable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics
