AEMIM: Adversarial Examples Meet Masked Image Modeling
Wenzhao Xiang, Chang Liu, Hang Su, Hongyang Yu

TL;DR
This paper introduces AEMIM, a novel approach that integrates adversarial examples into masked image modeling to improve representation learning, robustness, and efficiency across various vision tasks.
Contribution
It presents a new adversarial-based pretext task and attack method for masked image modeling, enhancing generalization and robustness without restricting model architecture.
Findings
Outperforms baseline methods on ImageNet and variants
Improves robustness against adversarial attacks
Enhances generalization to downstream tasks
Abstract
Masked image modeling (MIM) has gained significant traction for its remarkable prowess in representation learning. As an alternative to the traditional approach, the reconstruction from corrupted images has recently emerged as a promising pretext task. However, the regular corrupted images are generated using generic generators, often lacking relevance to the specific reconstruction task involved in pre-training. Hence, reconstruction from regular corrupted images cannot ensure the difficulty of the pretext task, potentially leading to a performance decline. Moreover, generating corrupted images might introduce an extra generator, resulting in a notable computational burden. To address these issues, we propose to incorporate adversarial examples into masked image modeling, as the new reconstruction targets. Adversarial examples, generated online using only the trained models, can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques · Computer Graphics and Visualization Techniques
MethodsMutual Information Machine/Mask Image Modeling
