WaterMamba: Visual State Space Model for Underwater Image Enhancement

Meisheng Guan; Haiyong Xu; Gangyi Jiang; Mei Yu; Yeyao Chen; Ting Luo,; Yang Song

arXiv:2405.08419·cs.CV·May 15, 2024·6 cites

WaterMamba: Visual State Space Model for Underwater Image Enhancement

Meisheng Guan, Haiyong Xu, Gangyi Jiang, Mei Yu, Yeyao Chen, Ting Luo,, Yang Song

PDF

Open Access

TL;DR

WaterMamba introduces a novel state space model for underwater image enhancement that efficiently models pixel and channel dependencies, outperforming existing methods with fewer resources.

Contribution

It proposes a new linear-complexity state space model with SCOSS blocks for underwater image enhancement, addressing limitations of CNN and Transformer approaches.

Findings

01

Outperforms state-of-the-art methods on multiple datasets

02

Reduces parameters and computational resources

03

Demonstrates strong generalizability

Abstract

Underwater imaging often suffers from low quality due to factors affecting light propagation and absorption in water. To improve image quality, some underwater image enhancement (UIE) methods based on convolutional neural networks (CNN) and Transformer have been proposed. However, CNN-based UIE methods are limited in modeling long-range dependencies, and Transformer-based methods involve a large number of parameters and complex self-attention mechanisms, posing efficiency challenges. Considering computational complexity and severe underwater image degradation, a state space model (SSM) with linear computational complexity for UIE, named WaterMamba, is proposed. We propose spatial-channel omnidirectional selective scan (SCOSS) blocks comprising spatial-channel coordinate omnidirectional selective scan (SCCOSS) modules and a multi-scale feedforward network (MSFFN). The SCOSS block models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Enhancement Techniques · Video Surveillance and Tracking Methods · Advanced Neural Network Applications

MethodsLinear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding · Adam · Softmax