Transformer-Based Source-Free Domain Adaptation
Guanglei Yang, Hao Tang, Zhun Zhong, Mingli Ding, Ling Shao, Nicu, Sebe, Elisa Ricci

TL;DR
This paper introduces TransDA, a Transformer-based framework for source-free domain adaptation that enhances model focus on object regions, improving generalization and achieving state-of-the-art results across various adaptation tasks.
Contribution
The paper proposes a novel Transformer-based approach with self-supervised knowledge distillation for improved source-free domain adaptation.
Findings
TransDA significantly outperforms existing methods in accuracy.
Attention focusing on object regions improves domain adaptation.
State-of-the-art results on multiple domain adaptation benchmarks.
Abstract
In this paper, we study the task of source-free domain adaptation (SFDA), where the source data are not available during target adaptation. Previous works on SFDA mainly focus on aligning the cross-domain distributions. However, they ignore the generalization ability of the pretrained source model, which largely influences the initial target outputs that are vital to the target adaptation stage. To address this, we make the interesting observation that the model accuracy is highly correlated with whether or not attention is focused on the objects in an image. To this end, we propose a generic and effective framework based on Transformer, named TransDA, for learning a generalized model for SFDA. Specifically, we apply the Transformer as the attention module and inject it into a convolutional network. By doing so, the model is encouraged to turn attention towards the object regions, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and ELM
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Knowledge Distillation · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Byte Pair Encoding · Residual Connection
