Reiterative Domain Aware Multi-Target Adaptation
Sudipan Saha, Shan Zhao, Nasrullah Sheikh, Xiao Xiang Zhu

TL;DR
This paper introduces a reiterative, multi-target domain adaptation method using Transformers and a dual-classifier approach, which sequentially adapts to multiple unlabeled target domains and aggregates knowledge effectively.
Contribution
It proposes a novel reiterative adaptation strategy with a Transformer backbone and dual classifiers, including a graph neural network, for improved multi-target domain adaptation.
Findings
Achieved 10.7% average improvement on Office-Home dataset.
Significant performance gains over existing methods on multiple datasets.
Effective domain aggregation through confidence-based pseudo-labeling.
Abstract
Most domain adaptation methods focus on single-source-single-target adaptation settings. Multi-target domain adaptation is a powerful extension in which a single classifier is learned for multiple unlabeled target domains. To build a multi-target classifier, it is important to have: a feature extractor that generalizes well across domains; and effective aggregation of features from the labeled source and different unlabeled target domains. Towards the first, we use the recently popular Transformer as a feature extraction backbone. Towards the second, we use a co-teaching-based approach using a dual-classifier head, one of which is based on the graph neural network. The proposed approach uses a sequential adaptation strategy that adapts one domain at a time starting from the target domains that are more similar to the source, assuming that the network finds it easier to adapt to such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsMulti-Head Attention · Linear Layer · Attention Is All You Need · Absolute Position Encodings · Label Smoothing · Dense Connections · Byte Pair Encoding · Residual Connection · Dropout · Position-Wise Feed-Forward Layer
