Towards Self-Adaptive Pseudo-Label Filtering for Semi-Supervised Learning
Lei Zhu, Zhanghan Ke, Rynson Lau

TL;DR
This paper introduces a self-adaptive pseudo-label filtering method for semi-supervised learning that dynamically adjusts to model evolution, improving label quality and SSL performance especially with limited labeled data.
Contribution
The proposed SPF method automatically filters pseudo labels by modeling confidence distributions, eliminating manual tuning and enhancing SSL effectiveness.
Findings
SPF improves SSL accuracy with scarce labeled data.
Dynamic filtering reduces incorrect pseudo labels during training.
SPF outperforms fixed, handcrafted filtering strategies.
Abstract
Recent semi-supervised learning (SSL) methods typically include a filtering strategy to improve the quality of pseudo labels. However, these filtering strategies are usually hand-crafted and do not change as the model is updated, resulting in a lot of correct pseudo labels being discarded and incorrect pseudo labels being selected during the training process. In this work, we observe that the distribution gap between the confidence values of correct and incorrect pseudo labels emerges at the very beginning of the training, which can be utilized to filter pseudo labels. Based on this observation, we propose a Self-Adaptive Pseudo-Label Filter (SPF), which automatically filters noise in pseudo labels in accordance with model evolvement by modeling the confidence distribution throughout the training process. Specifically, with an online mixture model, we weight each pseudo-labeled sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWater Systems and Optimization · Text and Document Classification Technologies · Speech and Audio Processing
