Universal Test-time Adaptation through Weight Ensembling, Diversity Weighting, and Prior Correction
Robert A. Marsden, Mario D\"obler, Bin Yang

TL;DR
This paper introduces ROID, a comprehensive method for universal test-time adaptation that employs weight ensembling, diversity weighting, and prior correction to handle various distribution shifts and maintain model performance.
Contribution
It is the first work to address universal TTA across diverse environmental conditions using a broad spectrum of settings and novel weighting and correction techniques.
Findings
Sets new standards in universal TTA performance.
Effectively handles multiple domain shifts and class prior shifts.
Improves robustness and generalization in test-time adaptation.
Abstract
Since distribution shifts are likely to occur during test-time and can drastically decrease the model's performance, online test-time adaptation (TTA) continues to update the model after deployment, leveraging the current test data. Clearly, a method proposed for online TTA has to perform well for all kinds of environmental conditions. By introducing the variable factors domain non-stationarity and temporal correlation, we first unfold all practically relevant settings and define the entity as universal TTA. We want to highlight that this is the first work that covers such a broad spectrum, which is indispensable for the use in practice. To tackle the problem of universal TTA, we identify and highlight several challenges a self-training based method has to deal with: 1) model bias and the occurrence of trivial solutions when performing entropy minimization on varying sequence lengths…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Universal Test-Time Adaptation Through Weight Ensembling, Diversity Weighting, and Prior Correction· youtube
Taxonomy
TopicsCancer-related molecular mechanisms research · Domain Adaptation and Few-Shot Learning
MethodsTest
