Generalization error of min-norm interpolators in transfer learning
Yanke Song, Sohom Bhattacharya, Pragya Sur

TL;DR
This paper analyzes the generalization error of min-norm interpolators in transfer learning with diverse data sources, providing explicit conditions under which transfer learning improves or worsens performance, and proposing data-driven methods for optimal sample selection.
Contribution
It offers a novel theoretical analysis of pooled min-$\
Findings
Transfer learning benefits depend on SNR and shift-to-signal ratio.
Adding data can harm performance under low SNR or certain shifts.
The paper introduces a data-driven method to optimize the number of target samples.
Abstract
This paper establishes the generalization error of pooled min--norm interpolation in transfer learning where data from diverse distributions are available. Min-norm interpolators emerge naturally as implicit regularized limits of modern machine learning algorithms. Previous work characterized their out-of-distribution risk when samples from the test distribution are unavailable during training. However, in many applications, a limited amount of test data may be available during training, yet properties of min-norm interpolation in this setting are not well-understood. We address this gap by characterizing the bias and variance of pooled min--norm interpolation under covariate and model shifts. The pooled interpolator captures both early fusion and a form of intermediate fusion. Our results have several implications: under model shift, for low signal-to-noise ratio (SNR),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
