Identifiable Object-Centric Representation Learning via Probabilistic   Slot Attention

Avinash Kori; Francesco Locatello; Ainkaran Santhirasekaram; Francesca; Toni; Ben Glocker; Fabio De Sousa Ribeiro

arXiv:2406.07141·cs.LG·November 12, 2024

Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention

Avinash Kori, Francesco Locatello, Ainkaran Santhirasekaram, Francesca, Toni, Ben Glocker, Fabio De Sousa Ribeiro

PDF

Open Access 1 Repo

TL;DR

This paper introduces a probabilistic slot-attention method that guarantees the identifiability of object-centric representations without supervision, supported by theoretical analysis and empirical validation on various datasets.

Contribution

It presents a novel probabilistic approach to slot attention that offers theoretical guarantees for object representation identifiability in an unsupervised setting.

Findings

01

Theoretical identifiability guarantees are established for the proposed method.

02

Empirical validation confirms the approach works on both simple and complex datasets.

03

The method enables scalable, correct object-centric representations without supervision.

Abstract

Learning modular object-centric representations is crucial for systematic generalization. Existing methods show promising object-binding capabilities empirically, but theoretical identifiability guarantees remain relatively underdeveloped. Understanding when object-centric representations can theoretically be identified is crucial for scaling slot-based methods to high-dimensional images with correctness guarantees. To that end, we propose a probabilistic slot-attention algorithm that imposes an aggregate mixture prior over object-centric slot representations, thereby providing slot identifiability guarantees without supervision, up to an equivalence relation. We provide empirical verification of our theoretical identifiability result using both simple 2-dimensional data and high-resolution imaging datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

koriavinash1/psa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification