Cross-Modal Spherical Aggregation for Weakly Supervised Remote Sensing Shadow Removal
Kaichen Chi, Wei Jing, Junjie Li, Qiang Li, Qi Wang

TL;DR
This paper introduces S2-ShadowNet, a weakly supervised method using spherical feature space and cross-modal learning to improve remote sensing shadow removal by leveraging visible and infrared images.
Contribution
It proposes a novel spherical feature space and cross-domain translation approach for weakly supervised shadow removal in remote sensing imagery.
Findings
Effective shadow removal demonstrated on a large-scale benchmark.
Improved feature separation and modality alignment achieved.
No need for strict paired training data.
Abstract
Remote sensing shadow removal, which aims to recover contaminated surface information, is tricky since shadows typically display overwhelmingly low illumination intensities. In contrast, the infrared image is robust toward significant light changes, providing visual clues complementary to the visible image. Nevertheless, the existing methods ignore the collaboration between heterogeneous modalities, leading to undesired quality degradation. To fill this gap, we propose a weakly supervised shadow removal network with a spherical feature space, dubbed S2-ShadowNet, to explore the best of both worlds for visible and infrared modalities. Specifically, we employ a modal translation (visible-to-infrared) model to learn the cross-domain mapping, thus generating realistic infrared samples. Then, Swin Transformer is utilized to extract strong representational visible/infrared features.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Advanced Image Fusion Techniques · Image and Signal Denoising Methods
MethodsStochastic Depth · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Swin Transformer · Position-Wise Feed-Forward Layer · Dropout · Adam · Attention Is All You Need
