A Hybrid Transformer-Mamba Network for Single Image Deraining

Shangquan Sun; Wenqi Ren; Juxiang Zhou; Jianhou Gan; Rui Wang,; Xiaochun Cao

arXiv:2409.00410·cs.CV·September 4, 2024·3 cites

A Hybrid Transformer-Mamba Network for Single Image Deraining

Shangquan Sun, Wenqi Ren, Juxiang Zhou, Jianhou Gan, Rui Wang,, Xiaochun Cao

PDF

Open Access 1 Repo

TL;DR

This paper introduces TransMamba, a hybrid Transformer-Mamba network that effectively captures long-range dependencies and spectral information for single image deraining, outperforming existing methods.

Contribution

The paper proposes a dual-branch hybrid network with spectral-banded Transformer and Mamba layers, incorporating spectral coherence loss for improved deraining performance.

Findings

01

Outperforms state-of-the-art deraining methods on multiple datasets.

02

Effectively captures long-range and spectral dependencies in images.

03

Demonstrates superior reconstruction of clean images with preserved details.

Abstract

Existing deraining Transformers employ self-attention mechanisms with fixed-range windows or along channel dimensions, limiting the exploitation of non-local receptive fields. In response to this issue, we introduce a novel dual-branch hybrid Transformer-Mamba network, denoted as TransMamba, aimed at effectively capturing long-range rain-related dependencies. Based on the prior of distinct spectral-domain features of rain degradation and background, we design a spectral-banded Transformer blocks on the first branch. Self-attention is executed within the combination of the spectral-domain channel dimension to improve the ability of modeling long-range dependencies. To enhance frequency-specific information, we present a spectral enhanced feed-forward module that aggregates features in the spectral domain. In the second branch, Mamba layers are equipped with cascaded bidirectional state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sunshangquan/transmamba
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Advanced Vision and Imaging · Medical Image Segmentation Techniques

MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Mamba: Linear-Time Sequence Modeling with Selective State Spaces · Label Smoothing · Layer Normalization · Dropout · Position-Wise Feed-Forward Layer · Residual Connection