Safe Self-Refinement for Transformer-based Domain Adaptation

Tao Sun; Cheng Lu; Tianshuo Zhang; Haibin Ling

arXiv:2204.07683·cs.CV·April 19, 2022·5 cites

Safe Self-Refinement for Transformer-based Domain Adaptation

Tao Sun, Cheng Lu, Tianshuo Zhang, Haibin Ling

PDF

Open Access 1 Repo

TL;DR

This paper introduces SSRT, a transformer-based approach for unsupervised domain adaptation that employs safe self-refinement to enhance transferability and reduce model collapse, achieving state-of-the-art results on multiple benchmarks.

Contribution

The paper proposes a novel transformer-based UDA method with a safe self-refinement strategy, improving transfer performance and robustness over existing CNN-based methods.

Findings

01

Transformer backbone surpasses CNN on DomainNet.

02

Safe self-refinement reduces model collapse.

03

Achieves top performance on multiple benchmarks.

Abstract

Unsupervised Domain Adaptation (UDA) aims to leverage a label-rich source domain to solve tasks on a related unlabeled target domain. It is a challenging problem especially when a large domain gap lies between the source and target domains. In this paper we propose a novel solution named SSRT (Safe Self-Refinement for Transformer-based domain adaptation), which brings improvement from two aspects. First, encouraged by the success of vision transformers in various vision tasks, we arm SSRT with a transformer backbone. We find that the combination of vision transformer with simple adversarial adaptation surpasses best reported Convolutional Neural Network (CNN)-based results on the challenging DomainNet benchmark, showing its strong transferable feature representation. Second, to reduce the risk of model collapse and improve the effectiveness of knowledge transfer between domains with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tsun/ssrt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Residual Connection · Dense Connections · Layer Normalization · Vision Transformer