Rethinking Guidance Information to Utilize Unlabeled Samples:A Label   Encoding Perspective

Yulong Zhang; Yuan Yao; Shuhao Chen; Pengrong Jin; Yu Zhang; Jian Jin,; Jiangang Lu

arXiv:2406.02862·cs.CV·June 6, 2024·1 cites

Rethinking Guidance Information to Utilize Unlabeled Samples:A Label Encoding Perspective

Yulong Zhang, Yuan Yao, Shuhao Chen, Pengrong Jin, Yu Zhang, Jian Jin,, Jiangang Lu

PDF

Open Access 1 Repo

TL;DR

This paper introduces Label-Encoding Risk Minimization (LERM), a novel approach that improves learning from unlabeled data by ensuring both discriminability and diversity of predictions, outperforming existing methods in label-scarce scenarios.

Contribution

LERM is a new method that estimates label encodings from unlabeled data and aligns them with true labels, enhancing learning robustness with limited labels.

Findings

01

LERM outperforms existing methods in label-insufficient scenarios.

02

Theoretical analysis links LERM with ERM and EntMin.

03

LERM effectively balances prediction discriminability and diversity.

Abstract

Empirical Risk Minimization (ERM) is fragile in scenarios with insufficient labeled samples. A vanilla extension of ERM to unlabeled samples is Entropy Minimization (EntMin), which employs the soft-labels of unlabeled samples to guide their learning. However, EntMin emphasizes prediction discriminability while neglecting prediction diversity. To alleviate this issue, in this paper, we rethink the guidance information to utilize unlabeled samples. By analyzing the learning objective of ERM, we find that the guidance information for labeled samples in a specific category is the corresponding label encoding. Inspired by this finding, we propose a Label-Encoding Risk Minimization (LERM). It first estimates the label encodings through prediction means of unlabeled samples and then aligns them with their corresponding ground-truth label encodings. As a result, the LERM ensures both prediction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhangyl660/lerm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Software Engineering Research · Intelligent Tutoring Systems and Adaptive Learning