Limits of Model Selection under Transfer Learning
Steve Hanneke, Samory Kpotufe, Yasaman Mahdaviyeh

TL;DR
This paper investigates the fundamental limits of model selection in transfer learning, revealing that adaptive methods can be significantly slower than oracle approaches that utilize distributional distance information.
Contribution
It provides the first theoretical analysis of model selection under transfer learning, highlighting the impact of transfer distance on achievable rates.
Findings
Adaptive rates can be arbitrarily slower than oracle rates.
Transfer distance influences the complexity of model selection.
The study focuses on classification problems.
Abstract
Theoretical studies on transfer learning or domain adaptation have so far focused on situations with a known hypothesis class or model; however in practice, some amount of model selection is usually involved, often appearing under the umbrella term of hyperparameter-tuning: for example, one may think of the problem of tuning for the right neural network architecture towards a target task, while leveraging data from a related source task. Now, in addition to the usual tradeoffs on approximation vs estimation errors involved in model selection, this problem brings in a new complexity term, namely, the transfer distance between source and target distributions, which is known to vary with the choice of hypothesis class. We present a first study of this problem, focusing on classification; in particular, the analysis reveals some remarkable phenomena: adaptive rates, i.e., those…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Speech Recognition and Synthesis
