Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation

Yangyang Li; Xuanting Hao; Ronghua Shang; Licheng Jiao

arXiv:2411.08756·cs.CV·November 15, 2024

Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation

Yangyang Li, Xuanting Hao, Ronghua Shang, Licheng Jiao

PDF

Open Access

TL;DR

This paper introduces a novel class-wise masked image modeling approach that enhances semi-supervised semantic segmentation by improving intra-class feature connections and regularization, achieving state-of-the-art results.

Contribution

The paper proposes a class-wise masked image modeling method with feature aggregation for semi-supervised segmentation, extending masked image modeling to better utilize unlabeled data.

Findings

01

Achieves state-of-the-art performance on benchmark datasets.

02

Effectively mitigates semantic confusion through class-wise reconstruction.

03

Enhances regularization in semantic space.

Abstract

In view of the fact that semi- and self-supervised learning share a fundamental principle, effectively modeling knowledge from unlabeled data, various semi-supervised semantic segmentation methods have integrated representative self-supervised learning paradigms for further regularization. However, the potential of the state-of-the-art generative self-supervised paradigm, masked image modeling, has been scarcely studied. This paradigm learns the knowledge through establishing connections between the masked and visible parts of masked image, during the pixel reconstruction process. By inheriting and extending this insight, we successfully leverage masked image modeling to boost semi-supervised semantic segmentation. Specifically, we introduce a novel class-wise masked image modeling that independently reconstructs different image regions according to their respective classes. In this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Medical Image Segmentation Techniques · Advanced Image and Video Retrieval Techniques