Vision Transformer-based Adversarial Domain Adaptation

Yahan Li; Yuan Wu

arXiv:2404.15817·cs.CV·April 25, 2024·1 cites

Vision Transformer-based Adversarial Domain Adaptation

Yahan Li, Yuan Wu

PDF

Open Access 1 Repo

TL;DR

This paper explores using Vision Transformers as feature extractors in adversarial domain adaptation, demonstrating they can replace CNNs to improve performance in unsupervised domain adaptation tasks.

Contribution

It introduces the use of Vision Transformers in adversarial domain adaptation and shows they can be seamlessly integrated to enhance existing methods.

Findings

01

ViT can replace CNNs in UDA methods without modification.

02

Using ViT improves domain adaptation performance.

03

The approach is plug-and-play and easy to implement.

Abstract

Unsupervised domain adaptation (UDA) aims to transfer knowledge from a labeled source domain to an unlabeled target domain. The most recent UDA methods always resort to adversarial training to yield state-of-the-art results and a dominant number of existing UDA methods employ convolutional neural networks (CNNs) as feature extractors to learn domain invariant features. Vision transformer (ViT) has attracted tremendous attention since its emergence and has been widely used in various computer vision tasks, such as image classification, object detection, and semantic segmentation, yet its potential in adversarial domain adaptation has never been investigated. In this paper, we fill this gap by employing the ViT as the feature extractor in adversarial domain adaptation. Moreover, we empirically demonstrate that ViT can be a plug-and-play component in adversarial domain adaptation, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lluckyyh/vt-ada
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Anomaly Detection Techniques and Applications

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Multi-Head Attention · Dense Connections · Residual Connection · Softmax · Vision Transformer