Generalization error of min-norm interpolators in transfer learning

Yanke Song; Sohom Bhattacharya; Pragya Sur

arXiv:2406.13944·math.ST·June 21, 2024

Generalization error of min-norm interpolators in transfer learning

Yanke Song, Sohom Bhattacharya, Pragya Sur

PDF

Open Access

TL;DR

This paper analyzes the generalization error of min-norm interpolators in transfer learning with diverse data sources, providing explicit conditions under which transfer learning improves or worsens performance, and proposing data-driven methods for optimal sample selection.

Contribution

It offers a novel theoretical analysis of pooled min-$\

Findings

01

Transfer learning benefits depend on SNR and shift-to-signal ratio.

02

Adding data can harm performance under low SNR or certain shifts.

03

The paper introduces a data-driven method to optimize the number of target samples.

Abstract

This paper establishes the generalization error of pooled min- $ℓ_{2}$ -norm interpolation in transfer learning where data from diverse distributions are available. Min-norm interpolators emerge naturally as implicit regularized limits of modern machine learning algorithms. Previous work characterized their out-of-distribution risk when samples from the test distribution are unavailable during training. However, in many applications, a limited amount of test data may be available during training, yet properties of min-norm interpolation in this setting are not well-understood. We address this gap by characterizing the bias and variance of pooled min- $ℓ_{2}$ -norm interpolation under covariate and model shifts. The pooled interpolator captures both early fusion and a form of intermediate fusion. Our results have several implications: under model shift, for low signal-to-noise ratio (SNR),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications