Slot Attention with Re-Initialization and Self-Distillation
Rongzhen Zhao, Yi Zhao, Juho Kannala, Joni Pajarinen

TL;DR
This paper introduces DIAS, a novel method for Object-Centric Learning that reduces redundancy and improves slot initialization through re-initialization and self-distillation, leading to state-of-the-art results in object discovery and recognition.
Contribution
The paper proposes DIAS, a new approach that enhances Slot Attention by re-initializing redundant slots and applying self-distillation to improve object representation accuracy.
Findings
Achieves state-of-the-art performance on object discovery tasks
Improves object recognition and visual reasoning accuracy
Reduces redundant slots and enhances slot initialization quality
Abstract
Unlike popular solutions based on dense feature maps, Object-Centric Learning (OCL) represents visual scenes as sub-symbolic object-level feature vectors, termed slots, which are highly versatile for tasks involving visual modalities. OCL typically aggregates object superpixels into slots by iteratively applying competitive cross attention, known as Slot Attention, with the slots as the query. However, once initialized, these slots are reused naively, causing redundant slots to compete with informative ones for representing objects. This often results in objects being erroneously segmented into parts. Additionally, mainstream methods derive supervision signals solely from decoding slots into the input's reconstruction, overlooking potential supervision based on internal information. To address these issues, we propose Slot Attention with re-Initialization and self-Distillation (DIAS):…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques
