ReMixMatch: Semi-Supervised Learning with Distribution Alignment and   Augmentation Anchoring

David Berthelot; Nicholas Carlini; Ekin D. Cubuk; Alex Kurakin; Kihyuk; Sohn; Han Zhang; Colin Raffel

arXiv:1911.09785·cs.LG·February 17, 2020·164 cites

ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring

David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk, Sohn, Han Zhang, Colin Raffel

PDF

Open Access 3 Repos

TL;DR

ReMixMatch advances semi-supervised learning by integrating distribution alignment and augmentation anchoring, significantly reducing data requirements while maintaining high accuracy, and introduces a learned augmentation policy.

Contribution

It introduces ReMixMatch, a novel semi-supervised learning algorithm that combines distribution alignment and augmentation anchoring with learned strong augmentations.

Findings

01

ReMixMatch achieves high accuracy with 5-16 times less data than prior methods.

02

On CIFAR-10 with 250 labels, it reaches 93.73% accuracy.

03

The method requires only four labels per class for median accuracy.

Abstract

We improve the recently-proposed "MixMatch" semi-supervised learning algorithm by introducing two new techniques: distribution alignment and augmentation anchoring. Distribution alignment encourages the marginal distribution of predictions on unlabeled data to be close to the marginal distribution of ground-truth labels. Augmentation anchoring feeds multiple strongly augmented versions of an input into the model and encourages each output to be close to the prediction for a weakly-augmented version of the same input. To produce strong augmentations, we propose a variant of AutoAugment which learns the augmentation policy while the model is being trained. Our new algorithm, dubbed ReMixMatch, is significantly more data-efficient than prior work, requiring between $5 \times$ and $16 \times$ less data to reach the same accuracy. For example, on CIFAR-10 with 250 labeled examples we reach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Multimodal Machine Learning Applications

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · AutoAugment