KL Guided Domain Adaptation
A. Tuan Nguyen, Toan Tran, Yarin Gal, Philip H. S. Torr, At{\i}l{\i}m, G\"une\c{s} Baydin

TL;DR
This paper introduces a theoretically grounded, efficient domain adaptation method that minimizes reverse KL divergence between source and target representations, improving generalization without complex adversarial training.
Contribution
It derives a new generalization bound based on reverse KL divergence and proposes a practical, stable algorithm that outperforms existing marginal alignment techniques.
Findings
Outperforms other representation-alignment methods in experiments
Efficient estimation of KL divergence via minibatch samples
No need for additional networks or adversarial objectives
Abstract
Domain adaptation is an important problem and often needed for real-world applications. In this problem, instead of i.i.d. training and testing datapoints, we assume that the source (training) data and the target (testing) data have different distributions. With that setting, the empirical risk minimization training procedure often does not perform well, since it does not account for the change in the distribution. A common approach in the domain adaptation literature is to learn a representation of the input that has the same (marginal) distribution over the source and the target domain. However, these approaches often require additional networks and/or optimizing an adversarial (minimax) objective, which can be very expensive or unstable in practice. To improve upon these marginal alignment techniques, in this paper, we first derive a generalization bound for the target loss based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Speech Recognition and Synthesis
