SPMamba-YOLO: An Underwater Object Detection Network Based on Multi-Scale Feature Enhancement and Global Context Modeling
Guanghao Liao, Zhen Liu, Liyuan Cao, Yonghui Yang, Qi Li

TL;DR
SPMamba-YOLO is a novel underwater object detection network that combines multi-scale feature enhancement and global context modeling to improve detection accuracy in challenging underwater environments.
Contribution
It introduces a new network architecture integrating SPPELAN, PSA, and Mamba modules for enhanced feature aggregation and global context understanding.
Findings
Outperforms YOLOv8n baseline by over 4.9% [email protected]
Improves detection of small and dense underwater objects
Maintains efficient computational cost
Abstract
Underwater object detection is a critical yet challenging research problem owing to severe light attenuation, color distortion, background clutter, and the small scale of underwater targets. To address these challenges, we propose SPMamba-YOLO, a novel underwater object detection network that integrates multi-scale feature enhancement with global context modeling. Specifically, a Spatial Pyramid Pooling Enhanced Layer Aggregation Network (SPPELAN) module is introduced to strengthen multi-scale feature aggregation and expand the receptive field, while a Pyramid Split Attention (PSA) mechanism enhances feature discrimination by emphasizing informative regions and suppressing background interference. In addition, a Mamba-based state space modeling module is incorporated to efficiently capture long-range dependencies and global contextual information, thereby improving detection robustness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
