BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation
Daeun Lee, Jaehong Yoon, Sung Ju Hwang

TL;DR
BECoTTA introduces an input-dependent, efficient modular framework for continual test-time adaptation that improves domain adaptation performance while significantly reducing trainable parameters.
Contribution
It proposes MoDE, a novel mixture-of-experts approach with domain-adaptive routing and synergy loss for better continual test-time adaptation.
Findings
Outperforms existing CTTA methods in disjoint and gradual domain shifts.
Requires approximately 98% fewer trainable parameters.
Provides detailed analyses and visualizations of the method's components.
Abstract
Continual Test Time Adaptation (CTTA) is required to adapt efficiently to continuous unseen domains while retaining previously learned knowledge. However, despite the progress of CTTA, it is still challenging to deploy the model with improved forgetting-adaptation trade-offs and efficiency. In addition, current CTTA scenarios assume only the disjoint situation, even though real-world domains are seamlessly changed. To address these challenges, this paper proposes BECoTTA, an input-dependent and efficient modular framework for CTTA. We propose Mixture-of Domain Low-rank Experts (MoDE) that contains two core components: (i) Domain-Adaptive Routing, which helps to selectively capture the domain adaptive knowledge with multiple domain routers, and (ii) Domain-Expert Synergy Loss to maximize the dependency between each domain and expert. We validate that our method outperforms multiple CTTA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Assessment · Seismology and Earthquake Studies · Anomaly Detection Techniques and Applications
