ColorMAE: Exploring data-independent masking strategies in Masked   AutoEncoders

Carlos Hinojosa; Shuming Liu; Bernard Ghanem

arXiv:2407.13036·cs.CV·July 19, 2024

ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders

Carlos Hinojosa, Shuming Liu, Bernard Ghanem

PDF

Open Access 1 Repo 1 Models

TL;DR

ColorMAE introduces a data-independent masking strategy for Masked AutoEncoders that improves visual representation learning without extra computational costs, outperforming traditional random masking in downstream tasks.

Contribution

We propose ColorMAE, a novel data-independent masking method using color noise filtering, enhancing MAE performance without increasing model complexity.

Findings

01

Significant improvement in semantic segmentation accuracy (2.72 mIoU)

02

Outperforms random masking in downstream tasks

03

No additional computational overhead required

Abstract

Masked AutoEncoders (MAE) have emerged as a robust self-supervised framework, offering remarkable performance across a wide range of downstream tasks. To increase the difficulty of the pretext task and learn richer visual representations, existing works have focused on replacing standard random masking with more sophisticated strategies, such as adversarial-guided and teacher-guided masking. However, these strategies depend on the input data thus commonly increasing the model complexity and requiring additional calculations to generate the mask patterns. This raises the question: Can we enhance MAE performance beyond random masking without relying on input data or incurring additional computational costs? In this work, we introduce a simple yet effective data-independent method, termed ColorMAE, which generates different binary mask patterns by filtering random noise. Drawing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

carlosh93/ColorMAE
pytorchOfficial

Models

🤗
carlosh93/colormae
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsColor Science and Applications · Color perception and design

MethodsMasked autoencoder