Large Data and Zero Noise Limits of Graph-Based Semi-Supervised Learning   Algorithms

Matthew M. Dunlop; Dejan Slep\v{c}ev; Andrew M. Stuart; Matthew Thorpe

arXiv:1805.09450·stat.ML·January 1, 2019

Large Data and Zero Noise Limits of Graph-Based Semi-Supervised Learning Algorithms

Matthew M. Dunlop, Dejan Slep\v{c}ev, Andrew M. Stuart, Matthew Thorpe

PDF

TL;DR

This paper investigates the large data and zero noise limits of graph-based semi-supervised learning algorithms, analyzing their continuum analogues and conditions for well-defined limits using advanced mathematical tools.

Contribution

It introduces a framework for understanding the continuum limits of semi-supervised learning algorithms based on graph Laplacians, including both optimization and Bayesian approaches.

Findings

01

Conditions for well-defined continuum limits are identified.

02

Large graph limits are analyzed via $$-convergence and $TL^p$ metric.

03

Small noise limits of Bayesian methods are characterized.

Abstract

Scalings in which the graph Laplacian approaches a differential operator in the large graph limit are used to develop understanding of a number of algorithms for semi-supervised learning; in particular the extension, to this graph setting, of the probit algorithm, level set and kriging methods, are studied. Both optimization and Bayesian approaches are considered, based around a regularizing quadratic form found from an affine transformation of the Laplacian, raised to a, possibly fractional, exponent. Conditions on the parameters defining this quadratic form are identified under which well-defined limiting continuum analogues of the optimization and Bayesian semi-supervised learning problems may be found, thereby shedding light on the design of algorithms in the large graph setting. The large graph limits of the optimization formulations are tackled through $Γ -$ convergence, using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.