Transformer-Based Source-Free Domain Adaptation

Guanglei Yang; Hao Tang; Zhun Zhong; Mingli Ding; Ling Shao; Nicu; Sebe; Elisa Ricci

arXiv:2105.14138·cs.CV·June 1, 2021·21 cites

Transformer-Based Source-Free Domain Adaptation

Guanglei Yang, Hao Tang, Zhun Zhong, Mingli Ding, Ling Shao, Nicu, Sebe, Elisa Ricci

PDF

Open Access 1 Repo

TL;DR

This paper introduces TransDA, a Transformer-based framework for source-free domain adaptation that enhances model focus on object regions, improving generalization and achieving state-of-the-art results across various adaptation tasks.

Contribution

The paper proposes a novel Transformer-based approach with self-supervised knowledge distillation for improved source-free domain adaptation.

Findings

01

TransDA significantly outperforms existing methods in accuracy.

02

Attention focusing on object regions improves domain adaptation.

03

State-of-the-art results on multiple domain adaptation benchmarks.

Abstract

In this paper, we study the task of source-free domain adaptation (SFDA), where the source data are not available during target adaptation. Previous works on SFDA mainly focus on aligning the cross-domain distributions. However, they ignore the generalization ability of the pretrained source model, which largely influences the initial target outputs that are vital to the target adaptation stage. To address this, we make the interesting observation that the model accuracy is highly correlated with whether or not attention is focused on the objects in an image. To this end, we propose a generic and effective framework based on Transformer, named TransDA, for learning a generalized model for SFDA. Specifically, we apply the Transformer as the attention module and inject it into a convolutional network. By doing so, the model is encouraged to turn attention towards the object regions, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ygjwd12345/TransDA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and ELM

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Knowledge Distillation · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Byte Pair Encoding · Residual Connection