Towards Stable Test-Time Adaptation in Dynamic Wild World
Shuaicheng Niu, Jiaxiang Wu, Yifan Zhang, Zhiquan Wen, Yaofo Chen,, Peilin Zhao, Mingkui Tan

TL;DR
This paper addresses the instability of test-time adaptation (TTA) in dynamic environments by identifying batch norm as a key factor and proposing SAR, a method that enhances stability through noise filtering and flat minima optimization.
Contribution
The paper introduces SAR, a novel TTA stabilization technique that removes noisy samples and promotes flat minima, improving robustness in real-world, dynamic test scenarios.
Findings
SAR outperforms prior methods in stability and robustness.
TTA with batch-agnostic norms reduces failure cases.
Removing noisy samples prevents collapse to trivial solutions.
Abstract
Test-time adaptation (TTA) has shown to be effective at tackling distribution shifts between training and testing data by adapting a given model on test samples. However, the online model updating of TTA may be unstable and this is often a key obstacle preventing existing TTA methods from being deployed in the real world. Specifically, TTA may fail to improve or even harm the model performance when test data have: 1) mixed distribution shifts, 2) small batch sizes, and 3) online imbalanced label distribution shifts, which are quite common in practice. In this paper, we investigate the unstable reasons and find that the batch norm layer is a crucial factor hindering TTA stability. Conversely, TTA can perform more stably with batch-agnostic norm layers, \ie, group or layer norm. However, we observe that TTA with group and layer norms does not always succeed and still suffers many failure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications · Advanced MRI Techniques and Applications
Methodsfail · Test
