Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path   and Beyond

Haoxiang Wang; Bo Li; Han Zhao

arXiv:2204.08200·cs.LG·July 11, 2022·5 cites

Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond

Haoxiang Wang, Bo Li, Han Zhao

PDF

Open Access 3 Repos

TL;DR

This paper provides a new, improved theoretical analysis of gradual domain adaptation, reducing the error bound's dependency on the number of intermediate domains from exponential to linear, and suggests optimal strategies for constructing domain paths.

Contribution

It introduces a significantly tighter generalization bound for gradual self-training, relaxing previous assumptions and guiding the optimal design of intermediate domain paths.

Findings

01

The new bound depends linearly on the number of domains T.

02

An optimal number of intermediate domains T minimizes the generalization error.

03

Empirical results validate the theoretical improvements on real datasets.

Abstract

The vast majority of existing algorithms for unsupervised domain adaptation (UDA) focus on adapting from a labeled source domain to an unlabeled target domain directly in a one-off way. Gradual domain adaptation (GDA), on the other hand, assumes a path of $(T - 1)$ unlabeled intermediate domains bridging the source and target, and aims to provide better generalization in the target domain by leveraging the intermediate ones. Under certain assumptions, Kumar et al. (2020) proposed a simple algorithm, Gradual Self-Training, along with a generalization bound in the order of $e^{O (T)} (ε_{0} + O (l o g (T) / n))$ for the target domain error, where $ε_{0}$ is the source domain error and $n$ is the data size of each domain. Due to the exponential factor, this upper bound becomes vacuous when $T$ is only moderately large. In this work, we analyze gradual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsGradual Self-Training