Contrastive Heliophysical Image Pretraining for Solar Dynamics Observatory Records
Shiyu Shen, Zhe Gao, Taifeng Chai, Yang Huang, Bin Pan

TL;DR
SolarCHIP introduces contrastively pretrained visual backbones tailored for solar images, improving performance on cross-modal translation and flare classification while reducing data needs.
Contribution
It presents a novel multi-granularity contrastive pretraining framework specifically designed for multi-instrument solar imaging data.
Findings
Achieves state-of-the-art results on flare classification and cross-modal translation.
Enhances low-resource learning scenarios with limited labeled data.
Demonstrates the effectiveness of contrastive components at multiple spatial and modal levels.
Abstract
Deep learning has revolutionized solar image analysis, yet most approaches train task-specific encoders from scratch or rely on natural-image pretraining that ignores the unique characteristics of Solar Dynamics Observatory (SDO) data. We introduce SolarCHIP, a family of contrastively pretrained visual backbones tailored to multi-instrument SDO observations. SolarCHIP addresses three key challenges in solar imaging: multimodal sensing across AIA and HMI instruments, weak inter-class separability due to slow temporal evolution, and strong intra-class variability with sparse activity signals. Our pretraining framework employs a multi-granularity contrastive objective that jointly aligns (1) global class tokens across co-temporal AIA-HMI pairs to enhance temporal discrimination, (2) local patch tokens at fixed spatial indices to enforce position-consistent, modality-invariant features, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
