Large-scale Unsupervised Semantic Segmentation

Shanghua Gao; Zhong-Yu Li; Ming-Hsuan Yang; Ming-Ming Cheng; and Junwei Han; Philip Torr

arXiv:2106.03149·cs.CV·November 4, 2022

Large-scale Unsupervised Semantic Segmentation

Shanghua Gao, Zhong-Yu Li, Ming-Hsuan Yang, Ming-Ming Cheng, and Junwei Han, Philip Torr

PDF

Open Access 3 Repos 1 Datasets

TL;DR

This paper introduces the large-scale unsupervised semantic segmentation (LUSS) problem, creates a new ImageNet-S dataset for benchmarking, and proposes an effective method to advance research in unsupervised segmentation.

Contribution

It defines the LUSS problem, provides a large-scale benchmark dataset, and offers a simple method that performs well, facilitating progress in unsupervised semantic segmentation.

Findings

01

Created ImageNet-S dataset with 1.2 million images and 50k annotations.

02

Benchmarking of various supervised and unsupervised methods.

03

Identified challenges and future directions for LUSS.

Abstract

Empowered by large datasets, e.g., ImageNet, unsupervised learning on large-scale data has enabled significant advances for classification tasks. However, whether the large-scale unsupervised semantic segmentation can be achieved remains unknown. There are two major challenges: i) we need a large-scale benchmark for assessing algorithms; ii) we need to develop methods to simultaneously learn category and shape representation in an unsupervised manner. In this work, we propose a new problem of large-scale unsupervised semantic segmentation (LUSS) with a newly created benchmark dataset to help the research progress. Building on the ImageNet dataset, we propose the ImageNet-S dataset with 1.2 million training images and 50k high-quality semantic segmentation annotations for evaluation. Our benchmark has a high data diversity and a clear task objective. We also present a simple yet…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

braceletboy/imagenet-s
dataset· 127 dl
127 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques