Label Distribution Shift-Aware Prediction Refinement for Test-Time   Adaptation

Minguk Jang; Hye Won Chung

arXiv:2411.15204·cs.LG·February 6, 2025

Label Distribution Shift-Aware Prediction Refinement for Test-Time Adaptation

Minguk Jang, Hye Won Chung

PDF

Open Access

TL;DR

This paper introduces DART, a novel test-time adaptation method that refines predictions by focusing on class-wise confusion patterns, significantly improving accuracy under label distribution shifts without degrading performance otherwise.

Contribution

DART is the first method to explicitly model and correct class-wise confusion patterns during test-time adaptation, enhancing robustness to label distribution shifts.

Findings

01

DART achieves 5-18% accuracy gains on CIFAR-10C.

02

DART improves performance on multiple benchmarks including CIFAR, PACS, OfficeHome, and ImageNet.

03

DART enhances existing TTA methods by correcting inaccurate predictions caused by distribution shifts.

Abstract

Test-time adaptation (TTA) is an effective approach to mitigate performance degradation of trained models when encountering input distribution shifts at test time. However, existing TTA methods often suffer significant performance drops when facing additional class distribution shifts. We first analyze TTA methods under label distribution shifts and identify the presence of class-wise confusion patterns commonly observed across different covariate shifts. Based on this observation, we introduce label Distribution shift-Aware prediction Refinement for Test-time adaptation (DART), a novel TTA method that refines the predictions by focusing on class-wise confusion patterns. DART trains a prediction refinement module during an intermediate time by exposing it to several batches with diverse class distributions using the training dataset. This module is then used during test time to detect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics · Real-time simulation and control systems · Educational Technology and Assessment

MethodsDifficulty-Aware Rejection Tuning