Large Data and Zero Noise Limits of Graph-Based Semi-Supervised Learning Algorithms
Matthew M. Dunlop, Dejan Slep\v{c}ev, Andrew M. Stuart, Matthew Thorpe

TL;DR
This paper investigates the large data and zero noise limits of graph-based semi-supervised learning algorithms, analyzing their continuum analogues and conditions for well-defined limits using advanced mathematical tools.
Contribution
It introduces a framework for understanding the continuum limits of semi-supervised learning algorithms based on graph Laplacians, including both optimization and Bayesian approaches.
Findings
Conditions for well-defined continuum limits are identified.
Large graph limits are analyzed via $$-convergence and $TL^p$ metric.
Small noise limits of Bayesian methods are characterized.
Abstract
Scalings in which the graph Laplacian approaches a differential operator in the large graph limit are used to develop understanding of a number of algorithms for semi-supervised learning; in particular the extension, to this graph setting, of the probit algorithm, level set and kriging methods, are studied. Both optimization and Bayesian approaches are considered, based around a regularizing quadratic form found from an affine transformation of the Laplacian, raised to a, possibly fractional, exponent. Conditions on the parameters defining this quadratic form are identified under which well-defined limiting continuum analogues of the optimization and Bayesian semi-supervised learning problems may be found, thereby shedding light on the design of algorithms in the large graph setting. The large graph limits of the optimization formulations are tackled through convergence, using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
