TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation
Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao, Li, Rong Jin

TL;DR
TransFGU introduces a novel top-down unsupervised semantic segmentation framework that leverages high-level semantic concepts from large-scale data to achieve fine-grained segmentation in complex scenes, outperforming existing bottom-up methods.
Contribution
It is the first to propose a top-down approach for unsupervised semantic segmentation, utilizing high-level semantic priors to improve segmentation accuracy in complicated scenarios.
Findings
Outperforms state-of-the-art bottom-up methods on multiple benchmarks.
Robust to both object-centric and scene-centric datasets.
Effective at different levels of semantic granularity.
Abstract
Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations. Most existing methods are bottom-up approaches that try to group pixels into regions based on their visual cues or certain predefined rules. As a result, it is difficult for these bottom-up approaches to generate fine-grained semantic segmentation when coming to complicated scenes with multiple objects and some objects sharing similar visual appearance. In contrast, we propose the first top-down unsupervised semantic segmentation framework for fine-grained segmentation in extremely complicated scenarios. Specifically, we first obtain rich high-level structured semantic concept information from large-scale vision data in a self-supervised learning manner, and use such information as a prior to discover potential semantic categories presented in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
