MatIR: A Hybrid Mamba-Transformer Image Restoration Model
Juan Wen, Weiyan Hou, Luc Van Gool, Radu Timofte

TL;DR
MatIR is a novel hybrid image restoration model combining Mamba and Transformer architectures to leverage their respective strengths, achieving improved contextual learning and efficiency in image restoration tasks.
Contribution
This paper introduces MatIR, a hybrid Mamba-Transformer model with cross-cycled blocks and novel modules, enhancing image restoration performance beyond existing models.
Findings
Outperforms existing models in image restoration tasks
Effectively captures long-range dependencies with IRSS module
Demonstrates superior efficiency and accuracy in experiments
Abstract
In recent years, Transformers-based models have made significant progress in the field of image restoration by leveraging their inherent ability to capture complex contextual features. Recently, Mamba models have made a splash in the field of computer vision due to their ability to handle long-range dependencies and their significant computational efficiency compared to Transformers. However, Mamba currently lags behind Transformers in contextual learning capabilities. To overcome the limitations of these two models, we propose a Mamba-Transformer hybrid image restoration model called MatIR. Specifically, MatIR cross-cycles the blocks of the Transformer layer and the Mamba layer to extract features, thereby taking full advantage of the advantages of the two architectures. In the Mamba module, we introduce the Image Inpainting State Space (IRSS) module, which traverses along four scan…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Image and Signal Denoising Methods
MethodsAttention Is All You Need · Simple Piecewise Linear and Adaptive with Symmetric Hinges · Linear Layer · Dense Connections · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Softmax · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
