MarginMatch: Improving Semi-Supervised Learning with Pseudo-Margins

Tiberiu Sosea; Cornelia Caragea

arXiv:2308.09037·cs.CV·August 21, 2023·1 cites

MarginMatch: Improving Semi-Supervised Learning with Pseudo-Margins

Tiberiu Sosea, Cornelia Caragea

PDF

Open Access 1 Repo

TL;DR

MarginMatch is a semi-supervised learning method that enhances pseudo-label quality by analyzing training dynamics, leading to significant improvements on vision benchmarks with limited labeled data.

Contribution

It introduces a novel pseudo-labeling approach that uses training dynamics to improve pseudo-label quality in semi-supervised learning.

Findings

01

3.25% error rate reduction on CIFAR-100 with 25 labels per class

02

3.78% error rate reduction on STL-10 with 4 labels per class

03

Substantial improvements on multiple vision benchmarks

Abstract

We introduce MarginMatch, a new SSL approach combining consistency regularization and pseudo-labeling, with its main novelty arising from the use of unlabeled data training dynamics to measure pseudo-label quality. Instead of using only the model's confidence on an unlabeled example at an arbitrary iteration to decide if the example should be masked or not, MarginMatch also analyzes the behavior of the model on the pseudo-labeled examples as the training progresses, to ensure low quality predictions are masked out. MarginMatch brings substantial improvements on four vision benchmarks in low data regimes and on two large-scale datasets, emphasizing the importance of enforcing high-quality pseudo-labels. Notably, we obtain an improvement in error rate over the state-of-the-art of 3.25% on CIFAR-100 with only 25 labels per class and of 3.78% on STL-10 using as few as 4 labels per class. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tsosea2/marginmatch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications