Mathematical Foundations of Graph-Based Bayesian Semi-Supervised   Learning

Nicolas Garc\'ia Trillos; Daniel Sanz-Alonso; Ruiyi Yang

arXiv:2207.01093·stat.ML·July 5, 2022

Mathematical Foundations of Graph-Based Bayesian Semi-Supervised Learning

Nicolas Garc\'ia Trillos, Daniel Sanz-Alonso, Ruiyi Yang

PDF

Open Access

TL;DR

This paper reviews recent mathematical and statistical advances in graph-based Bayesian semi-supervised learning, emphasizing label propagation techniques and their theoretical foundations.

Contribution

It provides a detailed overview of mathematical tools and ideas underlying the statistical accuracy and computational efficiency of graph-based Bayesian SSL.

Findings

01

Mathematical frameworks for label propagation

02

Analysis of statistical accuracy in SSL

03

Computational methods for Bayesian SSL

Abstract

In recent decades, science and engineering have been revolutionized by a momentous growth in the amount of available data. However, despite the unprecedented ease with which data are now collected and stored, labeling data by supplementing each feature with an informative tag remains to be challenging. Illustrative tasks where the labeling process requires expert knowledge or is tedious and time-consuming include labeling X-rays with a diagnosis, protein sequences with a protein type, texts by their topic, tweets by their sentiment, or videos by their genre. In these and numerous other examples, only a few features may be manually labeled due to cost and time constraints. How can we best propagate label information from a small number of expensive labeled features to a vast number of unlabeled ones? This is the question addressed by semi-supervised learning (SSL). This article…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Machine Learning in Bioinformatics · Machine Learning and Data Classification