Grouped Discrete Representation for Object-Centric Learning
Rongzhen Zhao, Vivienne Wang, Juho Kannala, Joni Pajarinen

TL;DR
This paper introduces Grouped Discrete Representation (GDR), a novel approach for Object-Centric Learning that decomposes features into attribute groups and uses tuple code indexes for improved object separation and interpretability.
Contribution
GDR enhances object-centric learning by decomposing features into attribute groups and employing tuple code indexes, improving generalization and convergence.
Findings
GDR improves performance of existing OCL methods across datasets.
GDR enhances object separability and interpretability.
Visualizations show clearer object attribute distinctions.
Abstract
Object-Centric Learning (OCL) aims to discover objects in images or videos by reconstructing the input. Representative methods achieve this by reconstructing the input as its Variational Autoencoder (VAE) discrete representations, which suppress (super-)pixel noise and enhance object separability. However, these methods treat features as indivisible units, overlooking their compositional attributes, and discretize features via scalar code indexes, losing attribute-level similarities and differences. We propose Grouped Discrete Representation (GDR) for OCL. For better generalization, features are decomposed into combinatorial attributes by organized channel grouping. For better convergence, features are quantized into discrete representations via tuple code indexes. Experiments demonstrate that GDR consistently improves both mainstream and state-of-the-art OCL methods across various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
