Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui, Chen, Yandong Guo, and Shanghang Zhang

TL;DR
Continual-MAE introduces an adaptive self-supervised autoencoder approach for continual test-time adaptation, effectively handling distribution shifts and improving performance in classification and segmentation tasks.
Contribution
It proposes a novel Distribution-aware Masking mechanism and a reconstruction strategy using invariant feature descriptors to enhance knowledge extraction during continual adaptation.
Findings
Achieves state-of-the-art results on multiple benchmarks.
Effectively mitigates error accumulation and catastrophic forgetting.
Improves robustness to distribution shifts in real-world scenarios.
Abstract
Continual Test-Time Adaptation (CTTA) is proposed to migrate a source pre-trained model to continually changing target distributions, addressing real-world dynamism. Existing CTTA methods mainly rely on entropy minimization or teacher-student pseudo-labeling schemes for knowledge extraction in unlabeled target domains. However, dynamic data distributions cause miscalibrated predictions and noisy pseudo-labels in existing self-supervised learning methods, hindering the effective mitigation of error accumulation and catastrophic forgetting problems during the continual adaptation process. To tackle these issues, we propose a continual self-supervised method, Adaptive Distribution Masked Autoencoders (ADMA), which enhances the extraction of target domain knowledge while mitigating the accumulation of distribution shifts. Specifically, we propose a Distribution-aware Masking (DaM) mechanism…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
