Out-of-Distribution Generalization in Kernel Regression

Abdulkadir Canatar; Blake Bordelon; Cengiz Pehlevan

arXiv:2106.02261·stat.ML·February 8, 2022·1 cites

Out-of-Distribution Generalization in Kernel Regression

Abdulkadir Canatar, Blake Bordelon, Cengiz Pehlevan

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper develops an analytical framework using statistical physics to understand and optimize out-of-distribution generalization in kernel regression, applicable to various kernels and datasets.

Contribution

It introduces a replica-based analytical formula for out-of-distribution error in kernel regression and identifies distribution mismatch as a key factor affecting generalization.

Findings

01

Analytical expression for OOD generalization error derived

02

Mismatch quantified by an overlap matrix impacts performance

03

Optimization procedures for training and test distributions developed

Abstract

In real word applications, data generating process for training a machine learning model often differs from what the model encounters in the test stage. Understanding how and whether machine learning models generalize under such distributional shifts have been a theoretical challenge. Here, we study generalization in kernel regression when the training and test distributions are different using methods from statistical physics. Using the replica method, we derive an analytical formula for the out-of-distribution generalization error applicable to any kernel and real datasets. We identify an overlap matrix that quantifies the mismatch between distributions for a given kernel as a key determinant of generalization performance under distribution shift. Using our analytical expressions we elucidate various generalization phenomena including possible improvement in generalization when there…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pehlevan-group/kernel-ood-generalization
noneOfficial

Videos

Out-of-Distribution Generalization in Kernel Regression· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Neural Networks and Applications · Gaussian Processes and Bayesian Inference

MethodsLinear Regression