TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic   Segmentation

Zhaoyuan Yin; Pichao Wang; Fan Wang; Xianzhe Xu; Hanling Zhang; Hao; Li; Rong Jin

arXiv:2112.01515·cs.CV·July 26, 2022·5 cites

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao, Li, Rong Jin

PDF

Open Access 1 Repo

TL;DR

TransFGU introduces a novel top-down unsupervised semantic segmentation framework that leverages high-level semantic concepts from large-scale data to achieve fine-grained segmentation in complex scenes, outperforming existing bottom-up methods.

Contribution

It is the first to propose a top-down approach for unsupervised semantic segmentation, utilizing high-level semantic priors to improve segmentation accuracy in complicated scenarios.

Findings

01

Outperforms state-of-the-art bottom-up methods on multiple benchmarks.

02

Robust to both object-centric and scene-centric datasets.

03

Effective at different levels of semantic granularity.

Abstract

Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations. Most existing methods are bottom-up approaches that try to group pixels into regions based on their visual cues or certain predefined rules. As a result, it is difficult for these bottom-up approaches to generate fine-grained semantic segmentation when coming to complicated scenes with multiple objects and some objects sharing similar visual appearance. In contrast, we propose the first top-down unsupervised semantic segmentation framework for fine-grained segmentation in extremely complicated scenarios. Specifically, we first obtain rich high-level structured semantic concept information from large-scale vision data in a self-supervised learning manner, and use such information as a prior to discover potential semantic categories presented in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

damo-cv/transfgu
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques