Kernel Interpolation as a Bayes Point Machine

Jeremy Bernstein; Alex Farhang; Yisong Yue

arXiv:2110.04274·cs.LG·January 31, 2022·1 cites

Kernel Interpolation as a Bayes Point Machine

Jeremy Bernstein, Alex Farhang, Yisong Yue

PDF

Open Access 1 Repo

TL;DR

This paper reveals that kernel interpolation functions as a Bayes point machine, enabling new theoretical insights into generalization in neural networks through ensemble and convex geometry theories.

Contribution

It establishes that kernel interpolation is a Bayes point machine for Gaussian process classification, linking ensemble theory and convex geometry to neural network generalization.

Findings

01

Kernel interpolation acts as a Bayes point machine.

02

Large margin neural networks behave like Bayes point machines.

03

Derived PAC-Bayes risk bounds for kernel interpolation.

Abstract

A Bayes point machine is a single classifier that approximates the majority decision of an ensemble of classifiers. This paper observes that kernel interpolation is a Bayes point machine for Gaussian process classification. This observation facilitates the transfer of results from both ensemble theory as well as an area of convex geometry known as Brunn-Minkowski theory to derive PAC-Bayes risk bounds for kernel interpolation. Since large margin, infinite width neural networks are kernel interpolators, the paper's findings may help to explain generalisation in neural networks more broadly. Supporting this idea, the paper finds evidence that large margin, finite width neural networks behave like Bayes point machines too.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jxbz/implicit-bias
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification

MethodsGaussian Process