A Hybrid Transformer-Mamba Network for Single Image Deraining
Shangquan Sun, Wenqi Ren, Juxiang Zhou, Jianhou Gan, Rui Wang,, Xiaochun Cao

TL;DR
This paper introduces TransMamba, a hybrid Transformer-Mamba network that effectively captures long-range dependencies and spectral information for single image deraining, outperforming existing methods.
Contribution
The paper proposes a dual-branch hybrid network with spectral-banded Transformer and Mamba layers, incorporating spectral coherence loss for improved deraining performance.
Findings
Outperforms state-of-the-art deraining methods on multiple datasets.
Effectively captures long-range and spectral dependencies in images.
Demonstrates superior reconstruction of clean images with preserved details.
Abstract
Existing deraining Transformers employ self-attention mechanisms with fixed-range windows or along channel dimensions, limiting the exploitation of non-local receptive fields. In response to this issue, we introduce a novel dual-branch hybrid Transformer-Mamba network, denoted as TransMamba, aimed at effectively capturing long-range rain-related dependencies. Based on the prior of distinct spectral-domain features of rain degradation and background, we design a spectral-banded Transformer blocks on the first branch. Self-attention is executed within the combination of the spectral-domain channel dimension to improve the ability of modeling long-range dependencies. To enhance frequency-specific information, we present a spectral enhanced feed-forward module that aggregates features in the spectral domain. In the second branch, Mamba layers are equipped with cascaded bidirectional state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Vision and Imaging · Medical Image Segmentation Techniques
MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Mamba: Linear-Time Sequence Modeling with Selective State Spaces · Label Smoothing · Layer Normalization · Dropout · Position-Wise Feed-Forward Layer · Residual Connection
