CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation
Tongkun Xu, Weihua Chen, Pichao Wang, Fan Wang, Hao Li, Rong Jin

TL;DR
CDTrans introduces a pure transformer-based approach for unsupervised domain adaptation, utilizing a novel labeling algorithm and a triple-branch framework to improve domain alignment and feature learning.
Contribution
This work is the first to apply a pure transformer architecture to UDA, combining self-attention and cross-attention with a center-aware pseudo labeling method.
Findings
Achieves state-of-the-art results on VisDA-2017 and DomainNet datasets.
Outperforms CNN-based UDA methods in domain alignment accuracy.
Demonstrates robustness of transformer-based models in noisy pseudo label scenarios.
Abstract
Unsupervised domain adaptation (UDA) aims to transfer knowledge learned from a labeled source domain to a different unlabeled target domain. Most existing UDA methods focus on learning domain-invariant feature representation, either from the domain level or category level, using convolution neural networks (CNNs)-based frameworks. One fundamental problem for the category level based UDA is the production of pseudo labels for samples in target domain, which are usually too noisy for accurate domain alignment, inevitably compromising the UDA performance. With the success of Transformer in various tasks, we find that the cross-attention in Transformer is robust to the noisy input pairs for better feature alignment, thus in this paper Transformer is adopted for the challenging UDA task. Specifically, to generate accurate input pairs, we design a two-way center-aware labeling algorithm to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dropout · Dense Connections · Label Smoothing · Residual Connection · Adam
