ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders
Carlos Hinojosa, Shuming Liu, Bernard Ghanem

TL;DR
ColorMAE introduces a data-independent masking strategy for Masked AutoEncoders that improves visual representation learning without extra computational costs, outperforming traditional random masking in downstream tasks.
Contribution
We propose ColorMAE, a novel data-independent masking method using color noise filtering, enhancing MAE performance without increasing model complexity.
Findings
Significant improvement in semantic segmentation accuracy (2.72 mIoU)
Outperforms random masking in downstream tasks
No additional computational overhead required
Abstract
Masked AutoEncoders (MAE) have emerged as a robust self-supervised framework, offering remarkable performance across a wide range of downstream tasks. To increase the difficulty of the pretext task and learn richer visual representations, existing works have focused on replacing standard random masking with more sophisticated strategies, such as adversarial-guided and teacher-guided masking. However, these strategies depend on the input data thus commonly increasing the model complexity and requiring additional calculations to generate the mask patterns. This raises the question: Can we enhance MAE performance beyond random masking without relying on input data or incurring additional computational costs? In this work, we introduce a simple yet effective data-independent method, termed ColorMAE, which generates different binary mask patterns by filtering random noise. Drawing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColor Science and Applications · Color perception and design
MethodsMasked autoencoder
