Realistic Evaluation of Test-Time Adaptation Algorithms: Unsupervised Hyperparameter Selection
Sebastian Cygert, Damian S\'ojka, Tomasz Trzci\'nski, Bart{\l}omiej, Twardowski

TL;DR
This paper evaluates test-time adaptation algorithms using realistic hyperparameter selection methods, revealing that many recent methods perform worse under these conditions and highlighting the importance of supervision in hyperparameter tuning.
Contribution
It introduces surrogate-based hyperparameter selection strategies for TTA, providing a more realistic evaluation framework and analyzing their effectiveness compared to supervised methods.
Findings
Some state-of-the-art TTA methods perform worse with realistic hp-selection.
Forgetting remains a challenge in TTA, with reset strategies being more robust.
Supervised or pretraining-based hp-selection strategies are most effective.
Abstract
Test-Time Adaptation (TTA) has recently emerged as a promising strategy for tackling the problem of machine learning model robustness under distribution shifts by adapting the model during inference without access to any labels. Because of task difficulty, hyperparameters strongly influence the effectiveness of adaptation. However, the literature has provided little exploration into optimal hyperparameter selection. In this work, we tackle this problem by evaluating existing TTA methods using surrogate-based hp-selection strategies (which do not assume access to the test labels) to obtain a more realistic evaluation of their performance. We show that some of the recent state-of-the-art methods exhibit inferior performance compared to the previous algorithms when using our more realistic evaluation setup. Further, we show that forgetting is still a problem in TTA as the only method that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Domain Adaptation and Few-Shot Learning · Image Enhancement Techniques
