WaterMamba: Visual State Space Model for Underwater Image Enhancement
Meisheng Guan, Haiyong Xu, Gangyi Jiang, Mei Yu, Yeyao Chen, Ting Luo,, Yang Song

TL;DR
WaterMamba introduces a novel state space model for underwater image enhancement that efficiently models pixel and channel dependencies, outperforming existing methods with fewer resources.
Contribution
It proposes a new linear-complexity state space model with SCOSS blocks for underwater image enhancement, addressing limitations of CNN and Transformer approaches.
Findings
Outperforms state-of-the-art methods on multiple datasets
Reduces parameters and computational resources
Demonstrates strong generalizability
Abstract
Underwater imaging often suffers from low quality due to factors affecting light propagation and absorption in water. To improve image quality, some underwater image enhancement (UIE) methods based on convolutional neural networks (CNN) and Transformer have been proposed. However, CNN-based UIE methods are limited in modeling long-range dependencies, and Transformer-based methods involve a large number of parameters and complex self-attention mechanisms, posing efficiency challenges. Considering computational complexity and severe underwater image degradation, a state space model (SSM) with linear computational complexity for UIE, named WaterMamba, is proposed. We propose spatial-channel omnidirectional selective scan (SCOSS) blocks comprising spatial-channel coordinate omnidirectional selective scan (SCCOSS) modules and a multi-scale feedforward network (MSFFN). The SCOSS block models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
MethodsLinear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding · Adam · Softmax
