Mix-and-Match Tuning for Self-Supervised Semantic Segmentation
Xiaohang Zhan, Ziwei Liu, Ping Luo, Xiaoou Tang, Chen Change Loy

TL;DR
This paper introduces a 'mix-and-match' tuning stage for self-supervised semantic segmentation that significantly improves performance, surpassing fully-supervised pre-training without requiring additional labeled data.
Contribution
It proposes a pluggable M&M tuning method that enhances self-supervised learning by better utilizing limited pixel-wise annotations, achieving state-of-the-art results.
Findings
Boosts segmentation performance to surpass supervised pre-training.
Effective use of patch mixing and class-wise graph matching.
Achieves comparable or better results than ImageNet pre-training.
Abstract
Deep convolutional networks for semantic image segmentation typically require large-scale labeled data, e.g. ImageNet and MS COCO, for network pre-training. To reduce annotation efforts, self-supervised semantic segmentation is recently proposed to pre-train a network without any human-provided labels. The key of this new form of learning is to design a proxy task (e.g. image colorization), from which a discriminative loss can be formulated on unlabeled data. Many proxy tasks, however, lack the critical supervision signals that could induce discriminative representation for the target image segmentation task. Thus self-supervision's performance is still far from that of supervised pre-training. In this study, we overcome this limitation by incorporating a "mix-and-match" (M&M) tuning stage in the self-supervision pipeline. The proposed approach is readily pluggable to many…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
