Mix-and-Match Tuning for Self-Supervised Semantic Segmentation

Xiaohang Zhan; Ziwei Liu; Ping Luo; Xiaoou Tang; Chen Change Loy

arXiv:1712.00661·cs.CV·January 31, 2018·6 cites

Mix-and-Match Tuning for Self-Supervised Semantic Segmentation

Xiaohang Zhan, Ziwei Liu, Ping Luo, Xiaoou Tang, Chen Change Loy

PDF

Open Access

TL;DR

This paper introduces a 'mix-and-match' tuning stage for self-supervised semantic segmentation that significantly improves performance, surpassing fully-supervised pre-training without requiring additional labeled data.

Contribution

It proposes a pluggable M&M tuning method that enhances self-supervised learning by better utilizing limited pixel-wise annotations, achieving state-of-the-art results.

Findings

01

Boosts segmentation performance to surpass supervised pre-training.

02

Effective use of patch mixing and class-wise graph matching.

03

Achieves comparable or better results than ImageNet pre-training.

Abstract

Deep convolutional networks for semantic image segmentation typically require large-scale labeled data, e.g. ImageNet and MS COCO, for network pre-training. To reduce annotation efforts, self-supervised semantic segmentation is recently proposed to pre-train a network without any human-provided labels. The key of this new form of learning is to design a proxy task (e.g. image colorization), from which a discriminative loss can be formulated on unlabeled data. Many proxy tasks, however, lack the critical supervision signals that could induce discriminative representation for the target image segmentation task. Thus self-supervision's performance is still far from that of supervised pre-training. In this study, we overcome this limitation by incorporating a "mix-and-match" (M&M) tuning stage in the self-supervision pipeline. The proposed approach is readily pluggable to many…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications