Approximately Bayes-Optimal Pseudo Label Selection

Julian Rodemann; Jann Goschenhofer; Emilio Dorigatti; Thomas Nagler,; Thomas Augustin

arXiv:2302.08883·stat.ML·June 27, 2023·1 cites

Approximately Bayes-Optimal Pseudo Label Selection

Julian Rodemann, Jann Goschenhofer, Emilio Dorigatti, Thomas Nagler,, Thomas Augustin

PDF

Open Access

TL;DR

This paper introduces BPLS, a Bayesian pseudo-label selection framework that reduces confirmation bias in semi-supervised learning by using an analytically approximated posterior predictive criterion, improving performance on high-dimensional data.

Contribution

The paper develops a Bayesian criterion for pseudo-label selection, proving its Bayes optimality and providing an analytical approximation to enhance semi-supervised learning.

Findings

01

BPLS outperforms traditional methods on high-dimensional data.

02

The Bayesian criterion effectively mitigates confirmation bias.

03

Analytical approximations make the method computationally feasible.

Abstract

Semi-supervised learning by self-training heavily relies on pseudo-label selection (PLS). The selection often depends on the initial model fit on labeled data. Early overfitting might thus be propagated to the final model by selecting instances with overconfident but erroneous predictions, often referred to as confirmation bias. This paper introduces BPLS, a Bayesian framework for PLS that aims to mitigate this issue. At its core lies a criterion for selecting instances to label: an analytical approximation of the posterior predictive of pseudo-samples. We derive this selection criterion by proving Bayes optimality of the posterior predictive of pseudo-samples. We further overcome computational hurdles by approximating the criterion analytically. Its relation to the marginal likelihood allows us to come up with an approximation based on Laplace's method and the Gaussian integral. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification