Exploring the Role of the Bottleneck in Slot-Based Models Through   Covariance Regularization

Andrew Stange; Robert Lo; Abishek Sridhar; Kousik Rajesh

arXiv:2306.02577·cs.CV·June 6, 2023·1 cites

Exploring the Role of the Bottleneck in Slot-Based Models Through Covariance Regularization

Andrew Stange, Robert Lo, Abishek Sridhar, Kousik Rajesh

PDF

Open Access 1 Repo

TL;DR

This paper investigates how constraining the bottleneck in slot-based models with covariance regularization affects their performance, aiming to improve image reconstruction and mask quality on real-world datasets.

Contribution

It introduces a loss-based method to regularize the bottleneck in slot-based models, enabling larger encoders and improving over baseline Slot Attention models.

Findings

01

Improved performance over baseline Slot Attention

02

Feature reconstruction outperforms image reconstruction

03

Bottleneck regularization enhances model capacity

Abstract

In this project we attempt to make slot-based models with an image reconstruction objective competitive with those that use a feature reconstruction objective on real world datasets. We propose a loss-based approach to constricting the bottleneck of slot-based models, allowing larger-capacity encoder networks to be used with Slot Attention without producing degenerate stripe-shaped masks. We find that our proposed method offers an improvement over the baseline Slot Attention model but does not reach the performance of \dinosaur on the COCO2017 dataset. Throughout this project, we confirm the superiority of a feature reconstruction objective over an image reconstruction objective and explore the role of the architectural bottleneck in slot-based models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

robert1003/slot-attention-disentanglement
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · AI in cancer detection · Medical Image Segmentation Techniques