Not All Unlabeled Data are Equal: Learning to Weight Data in   Semi-supervised Learning

Zhongzheng Ren; Raymond A. Yeh; Alexander G. Schwing

arXiv:2007.01293·cs.LG·October 30, 2020·20 cites

Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning

Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel semi-supervised learning approach that assigns individual weights to unlabeled data points using influence functions, improving performance over existing methods.

Contribution

It proposes a new method to dynamically weight unlabeled examples in SSL using influence functions, replacing manual tuning of weights.

Findings

01

Outperforms state-of-the-art SSL methods on image classification tasks.

02

Efficient approximation of influence functions enables scalable training.

03

Improves model performance by selectively weighting unlabeled data.

Abstract

Existing semi-supervised learning (SSL) algorithms use a single weight to balance the loss of labeled and unlabeled examples, i.e., all unlabeled examples are equally weighted. But not all unlabeled data are equal. In this paper we study how to use a different weight for every unlabeled example. Manual tuning of all those weights -- as done in prior work -- is no longer possible. Instead, we adjust those weights via an algorithm based on the influence function, a measure of a model's dependency on one training example. To make the approach efficient, we propose a fast and effective approximation of the influence function. We demonstrate that this technique outperforms state-of-the-art methods on semi-supervised image and language classification tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Multimodal Machine Learning Applications