Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation
Ming Zhao, Xin Zhang, Andr\'e Kaup

TL;DR
This paper introduces MLDet, a multitask learning framework for SAR ship detection that integrates detection, speckle suppression, and segmentation tasks, improving accuracy and robustness in noisy, complex environments.
Contribution
The paper presents a novel multitask learning approach with a Gaussian-mask based segmentation and a weighted rotated boxes fusion strategy for enhanced SAR ship detection.
Findings
MLDet outperforms existing methods on SSDD+ and HRSID datasets.
The angle classification loss improves detection accuracy by addressing aspect ratio and angular periodicity.
The dual-feature fusion attention mechanism enhances robustness against speckle noise.
Abstract
Detecting ships in synthetic aperture radar (SAR) images is challenging due to strong speckle noise, complex surroundings, and varying scales. This paper proposes MLDet, a multitask learning framework for SAR ship detection, consisting of object detection, speckle suppression, and target segmentation tasks. An angle classification loss with aspect ratio weighting is introduced to improve detection accuracy by addressing angular periodicity and object proportions. The speckle suppression task uses a dual-feature fusion attention mechanism to reduce noise and fuse shallow and denoising features, enhancing robustness. The target segmentation task, leveraging a rotated Gaussian-mask, aids the network in extracting target regions from cluttered backgrounds and improves detection efficiency with pixel-level predictions. The Gaussian-mask ensures ship centers have the highest probabilities,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need
