Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals
Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Luc Van, Gool

TL;DR
This paper introduces an unsupervised method for semantic segmentation that learns pixel embeddings through contrastive optimization with a prior, enabling clustering and transfer to new datasets without supervision.
Contribution
It presents a novel two-step contrastive framework with a prior for unsupervised semantic segmentation on challenging datasets, surpassing existing methods.
Findings
Effective pixel embedding clustering on PASCAL dataset
Improved transfer performance on COCO and DAVIS datasets
First unsupervised semantic segmentation on such challenging benchmarks
Abstract
Being able to learn dense semantic representations of images without supervision is an important problem in computer vision. However, despite its significance, this problem remains rather unexplored, with a few exceptions that considered unsupervised semantic segmentation on small-scale datasets with a narrow visual domain. In this paper, we make a first attempt to tackle the problem on datasets that have been traditionally utilized for the supervised case. To achieve this, we introduce a two-step framework that adopts a predetermined mid-level prior in a contrastive optimization objective to learn pixel embeddings. This marks a large deviation from existing works that relied on proxy tasks or end-to-end clustering. Additionally, we argue about the importance of having a prior that contains information about objects, or their parts, and discuss several possibilities to obtain such a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsSpatial Pyramid Pooling · Batch Normalization · 1x1 Convolution · Dilated Convolution · Atrous Spatial Pyramid Pooling · DeepLabv3
