Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual   Test-Time Adaptation

Jiaming Liu; Ran Xu; Senqiao Yang; Renrui Zhang; Qizhe Zhang; Zehui; Chen; Yandong Guo; and Shanghang Zhang

arXiv:2312.12480·cs.CV·March 28, 2024·2 cites

Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation

Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui, Chen, Yandong Guo, and Shanghang Zhang

PDF

Open Access

TL;DR

Continual-MAE introduces an adaptive self-supervised autoencoder approach for continual test-time adaptation, effectively handling distribution shifts and improving performance in classification and segmentation tasks.

Contribution

It proposes a novel Distribution-aware Masking mechanism and a reconstruction strategy using invariant feature descriptors to enhance knowledge extraction during continual adaptation.

Findings

01

Achieves state-of-the-art results on multiple benchmarks.

02

Effectively mitigates error accumulation and catastrophic forgetting.

03

Improves robustness to distribution shifts in real-world scenarios.

Abstract

Continual Test-Time Adaptation (CTTA) is proposed to migrate a source pre-trained model to continually changing target distributions, addressing real-world dynamism. Existing CTTA methods mainly rely on entropy minimization or teacher-student pseudo-labeling schemes for knowledge extraction in unlabeled target domains. However, dynamic data distributions cause miscalibrated predictions and noisy pseudo-labels in existing self-supervised learning methods, hindering the effective mitigation of error accumulation and catastrophic forgetting problems during the continual adaptation process. To tackle these issues, we propose a continual self-supervised method, Adaptive Distribution Masked Autoencoders (ADMA), which enhances the extraction of target domain knowledge while mitigating the accumulation of distribution shifts. Specifically, we propose a Distribution-aware Masking (DaM) mechanism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition