Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond
Haoxiang Wang, Bo Li, Han Zhao

TL;DR
This paper provides a new, improved theoretical analysis of gradual domain adaptation, reducing the error bound's dependency on the number of intermediate domains from exponential to linear, and suggests optimal strategies for constructing domain paths.
Contribution
It introduces a significantly tighter generalization bound for gradual self-training, relaxing previous assumptions and guiding the optimal design of intermediate domain paths.
Findings
The new bound depends linearly on the number of domains T.
An optimal number of intermediate domains T minimizes the generalization error.
Empirical results validate the theoretical improvements on real datasets.
Abstract
The vast majority of existing algorithms for unsupervised domain adaptation (UDA) focus on adapting from a labeled source domain to an unlabeled target domain directly in a one-off way. Gradual domain adaptation (GDA), on the other hand, assumes a path of unlabeled intermediate domains bridging the source and target, and aims to provide better generalization in the target domain by leveraging the intermediate ones. Under certain assumptions, Kumar et al. (2020) proposed a simple algorithm, Gradual Self-Training, along with a generalization bound in the order of for the target domain error, where is the source domain error and is the data size of each domain. Due to the exponential factor, this upper bound becomes vacuous when is only moderately large. In this work, we analyze gradual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsGradual Self-Training
