Semi-Supervised learning with Density-Ratio Estimation
Masanori Kawakita, Takafumi Kanamori

TL;DR
This paper introduces a semi-supervised learning method using density-ratio estimation that improves prediction accuracy without requiring well-specified probabilistic models, especially under model misspecification.
Contribution
The paper proposes a novel density-ratio estimator for semi-supervised learning that enhances accuracy without needing strict probabilistic assumptions.
Findings
The proposed estimator outperforms supervised learning in accuracy.
It does not require well-specified probabilistic models.
Numerical experiments confirm the method's effectiveness.
Abstract
In this paper, we study statistical properties of semi-supervised learning, which is considered as an important problem in the community of machine learning. In the standard supervised learning, only the labeled data is observed. The classification and regression problems are formalized as the supervised learning. In semi-supervised learning, unlabeled data is also obtained in addition to labeled data. Hence, exploiting unlabeled data is important to improve the prediction accuracy in semi-supervised learning. This problems is regarded as a semiparametric estimation problem with missing data. Under the the discriminative probabilistic models, it had been considered that the unlabeled data is useless to improve the estimation accuracy. Recently, it was revealed that the weighted estimator using the unlabeled data achieves better prediction accuracy in comparison to the learning method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning and Data Classification · Machine Learning and Algorithms
