Posterior Consistency of Semi-Supervised Regression on Graphs
Andrea L. Bertozzi, Bamdad Hosseini, Hao Li, Kevin Miller, Andrew M., Stuart

TL;DR
This paper investigates the consistency of semi-supervised regression on graphs using a Bayesian approach, providing theoretical bounds and insights into hyperparameter selection for well-clustered graphs with small label noise.
Contribution
It introduces a Bayesian formulation of SSR with a graph Laplacian prior and analyzes the posterior contraction rates in relation to label noise and clustering.
Findings
Derived bounds on posterior contraction rates.
Numerical experiments validating theoretical results.
Insights into hyperparameter tuning for graph-based SSR.
Abstract
Graph-based semi-supervised regression (SSR) is the problem of estimating the value of a function on a weighted graph from its values (labels) on a small subset of the vertices. This paper is concerned with the consistency of SSR in the context of classification, in the setting where the labels have small noise and the underlying graph weighting is consistent with well-clustered nodes. We present a Bayesian formulation of SSR in which the weighted graph defines a Gaussian prior, using a graph Laplacian, and the labeled data defines a likelihood. We analyze the rate of contraction of the posterior measure around the ground truth in terms of parameters that quantify the small label error and inherent clustering in the graph. We obtain bounds on the rates of contraction and illustrate their sharpness through numerical experiments. The analysis also gives insight into the choice of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
