Probability-density-aware Semi-supervised Learning
Shuyang Liu, Ruiqiu Zheng, Yunhang Shen, Ke Li, Xing Sun, Zhou Yu,, Shaohui Lin

TL;DR
This paper introduces a probability-density-aware measure for semi-supervised learning that leverages cluster assumptions more effectively, leading to improved label propagation and better performance.
Contribution
It proposes a novel probability-density-aware measure and an associated label propagation algorithm, providing a theoretical foundation and demonstrating superior empirical results.
Findings
PMLP outperforms recent semi-supervised learning methods.
Pseudo-labeling is a special case of PMLP, explaining its effectiveness.
Theoretical analysis confirms the importance of density in SSL.
Abstract
Semi-supervised learning (SSL) assumes that neighbor points lie in the same category (neighbor assumption), and points in different clusters belong to various categories (cluster assumption). Existing methods usually rely on similarity measures to retrieve the similar neighbor points, ignoring cluster assumption, which may not utilize unlabeled information sufficiently and effectively. This paper first provides a systematical investigation into the significant role of probability density in SSL and lays a solid theoretical foundation for cluster assumption. To this end, we introduce a Probability-Density-Aware Measure (PM) to discern the similarity between neighbor points. To further improve Label Propagation, we also design a Probability-Density-Aware Measure Label Propagation (PMLP) algorithm to fully consider the cluster assumption in label propagation. Last but not least, we prove…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsFace and Expression Recognition · Advanced Clustering Algorithms Research · Machine Learning and Data Classification
