Sample Complexity of Nonparametric Semi-Supervised Learning
Chen Dan, Liu Leqi, Bryon Aragam, Pradeep Ravikumar, Eric P. Xing

TL;DR
This paper analyzes the sample complexity of nonparametric semi-supervised learning, establishing bounds on labeled samples needed for multiclass classification without parametric assumptions, and introduces algorithms based on bipartite graph matching.
Contribution
It introduces new assumptions based on mixture model mismatch, derives labeled sample complexity bounds for multiclass SSL, and proposes algorithms with experimental validation.
Findings
Labeled sample complexity bound of $oldsymbol{ ext{Ω}(K ext{ log } K)}$ for multiclass SSL.
Near-optimal classifiers can be learned with few labeled samples in nonparametric settings.
Algorithms based on bipartite graph matching outperform majority vote in experiments.
Abstract
We study the sample complexity of semi-supervised learning (SSL) and introduce new assumptions based on the mismatch between a mixture model learned from unlabeled data and the true mixture model induced by the (unknown) class conditional distributions. Under these assumptions, we establish an labeled sample complexity bound without imposing parametric assumptions, where is the number of classes. Our results suggest that even in nonparametric settings it is possible to learn a near-optimal classifier using only a few labeled samples. Unlike previous theoretical work which focuses on binary classification, we consider general multiclass classification (), which requires solving a difficult permutation learning problem. This permutation defines a classifier whose classification error is controlled by the Wasserstein distance between mixing measures, and we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Bayesian Methods and Mixture Models
