AllMatch: Exploiting All Unlabeled Data for Semi-Supervised Learning
Zhiyu Wu, Jinshi Cui

TL;DR
AllMatch introduces a novel semi-supervised learning method that adaptively utilizes all unlabeled data by class-specific thresholds and consistency regulation, leading to improved accuracy and full data utilization.
Contribution
The paper proposes a class-specific adaptive threshold mechanism and a binary classification consistency regulation to enhance semi-supervised learning, achieving full unlabeled data utilization.
Findings
Achieves 100% unlabeled data utilization.
Outperforms state-of-the-art methods on multiple benchmarks.
Improves pseudo-label accuracy across various settings.
Abstract
Existing semi-supervised learning algorithms adopt pseudo-labeling and consistency regulation techniques to introduce supervision signals for unlabeled samples. To overcome the inherent limitation of threshold-based pseudo-labeling, prior studies have attempted to align the confidence threshold with the evolving learning status of the model, which is estimated through the predictions made on the unlabeled data. In this paper, we further reveal that classifier weights can reflect the differentiated learning status across categories and consequently propose a class-specific adaptive threshold mechanism. Additionally, considering that even the optimal threshold scheme cannot resolve the problem of discarding unlabeled samples, a binary classification consistency regulation approach is designed to distinguish candidate classes from negative options for all unlabeled samples. By combining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
MethodsALIGN
