CD-CTFM: A Lightweight CNN-Transformer Network for Remote Sensing Cloud Detection Fusing Multiscale Features
Wenxuan Ge, Xubing Yang, Li Zhang

TL;DR
The paper introduces CD-CTFM, a lightweight CNN-Transformer network that efficiently detects clouds in remote sensing images by fusing multiscale features, achieving high accuracy with fewer parameters and computations.
Contribution
A novel lightweight CNN-Transformer architecture with multiscale feature fusion and attention mechanisms for efficient cloud detection in satellite images.
Findings
Achieves comparable accuracy to state-of-the-art methods.
Outperforms existing methods in efficiency.
Effective multiscale feature fusion improves detection performance.
Abstract
Clouds in remote sensing images inevitably affect information extraction, which hinder the following analysis of satellite images. Hence, cloud detection is a necessary preprocessing procedure. However, the existing methods have numerous calculations and parameters. In this letter, a lightweight CNN-Transformer network, CD-CTFM, is proposed to solve the problem. CD-CTFM is based on encoder-decoder architecture and incorporates the attention mechanism. In the decoder part, we utilize a lightweight network combing CNN and Transformer as backbone, which is conducive to extract local and global features simultaneously. Moreover, a lightweight feature pyramid module is designed to fuse multiscale features with contextual information. In the decoder part, we integrate a lightweight channel-spatial attention module into each skip connection between encoder and decoder, extracting low-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Advanced Image Fusion Techniques · Remote Sensing in Agriculture
MethodsAttention Is All You Need · Dropout · Label Smoothing · Dense Connections · Linear Layer · Position-Wise Feed-Forward Layer · Layer Normalization · Multi-Head Attention · Absolute Position Encodings · Residual Connection
