Multi-dimensional Visual Prompt Enhanced Image Restoration via Mamba-Transformer Aggregation
Aiwen Jiang, Hourong Chen, Zhiwen Chen, Jihua Ye, Mingwen Wang

TL;DR
This paper introduces a novel image restoration model that combines Mamba and Transformer mechanisms to efficiently capture spatial and channel dependencies, achieving state-of-the-art results across multiple restoration tasks.
Contribution
It proposes a multi-dimensional prompt learning framework that leverages Mamba's selective spatial scanning and Transformer’s channel attention for improved efficiency and effectiveness.
Findings
Achieves state-of-the-art performance on denoising, dehazing, and deraining benchmarks.
Utilizes linear complexity spatial modeling with Mamba for long-range dependencies.
Employs multi-scale prompt learning to enhance restoration capabilities.
Abstract
Recent efforts on image restoration have focused on developing "all-in-one" models that can handle different degradation types and levels within single model. However, most of mainstream Transformer-based ones confronted with dilemma between model capabilities and computation burdens, since self-attention mechanism quadratically increase in computational complexity with respect to image size, and has inadequacies in capturing long-range dependencies. Most of Mamba-related ones solely scanned feature map in spatial dimension for global modeling, failing to fully utilize information in channel dimension. To address aforementioned problems, this paper has proposed to fully utilize complementary advantages from Mamba and Transformer without sacrificing computation efficiency. Specifically, the selective scanning mechanism of Mamba is employed to focus on spatial modeling, enabling capture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Image and Signal Denoising Methods
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Absolute Position Encodings · Dense Connections · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing · Residual Connection · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
