Next state prediction gives rise to entangled, yet compositional   representations of objects

Tankred Saanum; Luca M. Schulze Buschoff; Peter Dayan; Eric Schulz

arXiv:2410.04940·cs.LG·October 8, 2024

Next state prediction gives rise to entangled, yet compositional representations of objects

Tankred Saanum, Luca M. Schulze Buschoff, Peter Dayan, Eric Schulz

PDF

Open Access

TL;DR

This paper investigates whether distributed neural representations, trained on videos, can develop linearly separable object encodings comparable to slot-based models, revealing that such models can support effective compositional generalization without explicit priors.

Contribution

It demonstrates that distributed models can learn linearly separable object representations through unsupervised training, challenging the assumption that object-centric priors are necessary for compositional generalization.

Findings

01

Distributed models often match or outperform slot-based models in prediction tasks.

02

Linearly separable object representations can emerge without explicit object priors.

03

Object representations are partially overlapping but highly separable, aiding generalization.

Abstract

Compositional representations are thought to enable humans to generalize across combinatorially vast state spaces. Models with learnable object slots, which encode information about objects in separate latent codes, have shown promise for this type of generalization but rely on strong architectural priors. Models with distributed representations, on the other hand, use overlapping, potentially entangled neural codes, and their ability to support compositional generalization remains underexplored. In this paper we examine whether distributed models can develop linearly separable representations of objects, like slotted models, through unsupervised training on videos of object interactions. We show that, surprisingly, models with distributed representations often match or outperform models with object slots in downstream prediction tasks. Furthermore, we find that linearly separable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics