TL;DR
This paper introduces GeleNet, a transformer-based model for optical remote sensing image saliency detection, utilizing global-to-local feature extraction and attention modules to improve detection accuracy over CNN-based methods.
Contribution
The paper proposes a novel transformer-driven network with specialized attention modules and knowledge transfer for enhanced salient object detection in remote sensing images.
Findings
GeleNet outperforms state-of-the-art methods on three public datasets.
Transformer backbone captures global long-range dependencies effectively.
Attention modules improve local detail and orientation perception.
Abstract
Existing methods for Salient Object Detection in Optical Remote Sensing Images (ORSI-SOD) mainly adopt Convolutional Neural Networks (CNNs) as the backbone, such as VGG and ResNet. Since CNNs can only extract features within certain receptive fields, most ORSI-SOD methods generally follow the local-to-contextual paradigm. In this paper, we propose a novel Global Extraction Local Exploration Network (GeleNet) for ORSI-SOD following the global-to-local paradigm. Specifically, GeleNet first adopts a transformer backbone to generate four-level feature embeddings with global long-range dependencies. Then, GeleNet employs a Direction-aware Shuffle Weighted Spatial Attention Module (D-SWSAM) and its simplified version (SWSAM) to enhance local interactions, and a Knowledge Transfer Module (KTM) to further enhance cross-level contextual interactions. D-SWSAM comprehensively perceives the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsResidual Connection · Batch Normalization · Global Average Pooling · Kaiming Initialization · Softmax · Dense Connections · Max Pooling · Average Pooling · Convolution · Sigmoid Activation
