UniTTA: Unified Benchmark and Versatile Framework Towards Realistic   Test-Time Adaptation

Chaoqun Du; Yulin Wang; Jiayi Guo; Yizeng Han; Jie Zhou; Gao Huang

arXiv:2407.20080·cs.CV·July 30, 2024

UniTTA: Unified Benchmark and Versatile Framework Towards Realistic Test-Time Adaptation

Chaoqun Du, Yulin Wang, Jiayi Guo, Yizeng Han, Jie Zhou, Gao Huang

PDF

Open Access 1 Repo

TL;DR

UniTTA introduces a comprehensive benchmark and a versatile framework for realistic test-time adaptation, addressing diverse domain and class distribution challenges with state-of-the-art results.

Contribution

It provides the first unified benchmark covering 36 scenarios and a novel framework with BDN and COFA methods for effective TTA.

Findings

01

UniTTA framework outperforms existing methods on the benchmark.

02

The benchmark covers 36 realistic TTA scenarios.

03

The proposed methods achieve state-of-the-art results.

Abstract

Test-Time Adaptation (TTA) aims to adapt pre-trained models to the target domain during testing. In reality, this adaptability can be influenced by multiple factors. Researchers have identified various challenging scenarios and developed diverse methods to address these challenges, such as dealing with continual domain shifts, mixed domains, and temporally correlated or imbalanced class distributions. Despite these efforts, a unified and comprehensive benchmark has yet to be established. To this end, we propose a Unified Test-Time Adaptation (UniTTA) benchmark, which is comprehensive and widely applicable. Each scenario within the benchmark is fully described by a Markov state transition matrix for sampling from the original dataset. The UniTTA benchmark considers both domain and class as two independent dimensions of data and addresses various combinations of imbalance/balance and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leaplabthu/unitta
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Software Testing and Debugging Techniques · Educational Technology and Assessment