Test-Time Domain Adaptation by Learning Domain-Aware Batch Normalization
Yanan Wu, Zhixiang Chi, Yang Wang, Konstantinos N. Plataniotis, Songhe, Feng

TL;DR
This paper introduces a test-time domain adaptation method that manipulates only the affine parameters of batch normalization layers, combined with self-supervised learning and meta-learning, to improve adaptation to unseen domains without increasing inference cost.
Contribution
The work proposes a novel approach focusing on updating only BN affine parameters and using an auxiliary SSL branch with meta-learning for effective domain adaptation.
Findings
Outperforms prior methods on five WILDS datasets.
Maintains same inference cost by discarding auxiliary branch after adaptation.
Enhances domain knowledge extraction through self-supervised learning.
Abstract
Test-time domain adaptation aims to adapt the model trained on source domains to unseen target domains using a few unlabeled images. Emerging research has shown that the label and domain information is separately embedded in the weight matrix and batch normalization (BN) layer. Previous works normally update the whole network naively without explicitly decoupling the knowledge between label and domain. As a result, it leads to knowledge interference and defective distribution adaptation. In this work, we propose to reduce such learning interference and elevate the domain knowledge learning by only manipulating the BN layer. However, the normalization step in BN is intrinsically unstable when the statistics are re-estimated from a few samples. We find that ambiguities can be greatly reduced when only updating the two affine parameters in BN while keeping the source domain statistics. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Multimodal Machine Learning Applications
MethodsBatch Normalization
