InfMAE: A Foundation Model in the Infrared Modality
Fangcen Liu, Chenqiang Gao, Yaming Zhang, Junjie Guo, Jinhao Wang,, Deyu Meng

TL;DR
InfMAE introduces a novel infrared foundation model utilizing a new dataset, an information-aware masking strategy, and specialized encoder-decoder architecture, significantly advancing infrared vision tasks with superior performance.
Contribution
The paper presents the first infrared foundation model, InfMAE, with a new large-scale dataset, tailored masking strategy, multi-scale encoder, and infrared-specific decoder, addressing unique challenges in infrared vision.
Findings
InfMAE outperforms existing methods in downstream tasks.
The information-aware masking enhances representation learning.
The multi-scale encoder and infrared decoder improve task performance.
Abstract
In recent years, the foundation models have swept the computer vision field and facilitated the development of various tasks within different modalities. However, it remains an open question on how to design an infrared foundation model. In this paper, we propose InfMAE, a foundation model in infrared modality. We release an infrared dataset, called Inf30 to address the problem of lacking large-scale data for self-supervised learning in the infrared vision community. Besides, we design an information-aware masking strategy, which is suitable for infrared images. This masking strategy allows for a greater emphasis on the regions with richer information in infrared images during the self-supervised learning process, which is conducive to learning the generalized representation. In addition, we adopt a multi-scale encoder to enhance the performance of the pre-trained encoders in downstream…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAstronomical Observations and Instrumentation · Infrared Target Detection Methodologies
