When Small Guides Large: Cross-Model Co-Learning for Test-Time Adaptation
Chang'an Yi, Xiaohui Deng, Guohao Chen, Yan Zhou, Qinghua Lu, Shuaicheng Niu

TL;DR
This paper introduces COCA, a cross-model co-learning framework for test-time adaptation that leverages complementary knowledge between models of different sizes to improve domain adaptation performance.
Contribution
It proposes a novel cross-model co-learning approach for TTA, enabling models of varying sizes to mutually enhance each other's adaptation capabilities.
Findings
Small models can effectively guide larger models in TTA.
COCA significantly improves adaptation accuracy across various model architectures.
Cross-model co-learning boosts performance on domain shift benchmarks.
Abstract
Test-time Adaptation (TTA) adapts a given model to testing domain data with potential domain shifts through online unsupervised learning, yielding impressive performance. However, to date, existing TTA methods primarily focus on single-model adaptation. In this work, we investigate an intriguing question: how does cross-model knowledge influence the TTA process? Our findings reveal that, in TTA's unsupervised online setting, each model can provide complementary, confident knowledge to the others, even when there are substantial differences in model size. For instance, a smaller model like MobileViT (10.6M parameters) can effectively guide a larger model like ViT-Base (86.6M parameters). In light of this, we propose COCA, a Cross-Model Co-Learning framework for TTA, which mainly consists of two main strategies. 1) Co-adaptation adaptively integrates complementary knowledge from other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition
