SDCoNet: Saliency-Driven Multi-Task Collaborative Network for Remote Sensing Object Detection
Ruo Qi, Linhui Dai, Yusong Qin, Chaolei Yang, Yanshan Li

TL;DR
SDCoNet is a novel multi-task network that enhances remote sensing object detection by integrating saliency-driven super-resolution with collaborative feature sharing, improving accuracy on small objects in low-quality images.
Contribution
The paper introduces SDCoNet, which couples super-resolution and detection through implicit feature sharing, saliency-guided focus, and gradient routing, addressing misalignment and feature redundancy issues in remote sensing detection.
Findings
Outperforms existing algorithms in small object detection accuracy.
Effectively suppresses background clutter and emphasizes weak object regions.
Maintains competitive computational efficiency.
Abstract
In remote sensing images, complex backgrounds, weak object signals, and small object scales make accurate detection particularly challenging, especially under low-quality imaging conditions. A common strategy is to integrate single-image super-resolution (SR) before detection; however, such serial pipelines often suffer from misaligned optimization objectives, feature redundancy, and a lack of effective interaction between SR and detection. To address these issues, we propose a Saliency-Driven multi-task Collaborative Network (SDCoNet) that couples SR and detection through implicit feature sharing while preserving task specificity. SDCoNet employs the swin transformer-based shared encoder, where hierarchical window-shifted self-attention supports cross-task feature collaboration and adaptively balances the trade-off between texture refinement and semantic representation. In addition, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Visual Attention and Saliency Detection · Advanced Neural Network Applications
