On Learning Invariant Representation for Domain Adaptation
Han Zhao, Remi Tachet des Combes, Kun Zhang, Geoffrey J. Gordon

TL;DR
This paper challenges the assumption that learning domain-invariant features guarantees successful adaptation, introduces a new theoretical bound considering class-conditional shifts, and empirically validates these insights.
Contribution
It provides a counterexample to common beliefs, proposes a new generalization bound accounting for conditional shift, and establishes a fundamental tradeoff via an information-theoretic lower bound.
Findings
Counterexample shows invariance alone is insufficient for adaptation.
A new bound explicitly considers class-conditional distribution shifts.
Experimental results support the theoretical tradeoff and insights.
Abstract
Due to the ability of deep neural nets to learn rich representations, recent advances in unsupervised domain adaptation have focused on learning domain-invariant features that achieve a small error on the source domain. The hope is that the learnt representation, together with the hypothesis learnt from the source domain, can generalize to the target domain. In this paper, we first construct a simple counterexample showing that, contrary to common belief, the above conditions are not sufficient to guarantee successful domain adaptation. In particular, the counterexample exhibits \emph{conditional shift}: the class-conditional distributions of input features change between source and target domains. To give a sufficient condition for domain adaptation, we propose a natural and interpretable generalization upper bound that explicitly takes into account the aforementioned shift. Moreover,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Multimodal Machine Learning Applications
