An introduction to domain adaptation and transfer learning
Wouter M. Kouw, Marco Loog

TL;DR
This paper introduces the concepts of domain adaptation and transfer learning, explaining their importance in handling distribution shifts between training and test data, and categorizing various approaches to address these challenges.
Contribution
It provides a comprehensive overview of domain adaptation and transfer learning, including theoretical foundations, special cases of data shift, and diverse methodological approaches.
Findings
Categorizes approaches into importance-weighting, subspace mapping, and more.
Highlights the complexity and open questions in practical application.
Explains different types of data set shift like prior, covariate, and concept shift.
Abstract
In machine learning, if the training data is an unbiased sample of an underlying distribution, then the learned classification function will make accurate predictions for new samples. However, if the training data is not an unbiased sample, then there will be differences between how the training data is distributed and how the test data is distributed. Standard classifiers cannot cope with changes in data distributions between training and test phases, and will not perform well. Domain adaptation and transfer learning are sub-fields within machine learning that are concerned with accounting for these types of changes. Here, we present an introduction to these fields, guided by the question: when and how can a classifier generalize from a source to a target domain? We will start with a brief introduction into risk minimization, and how transfer learning and domain adaptation expand upon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Text and Document Classification Technologies
