MatIR: A Hybrid Mamba-Transformer Image Restoration Model

Juan Wen; Weiyan Hou; Luc Van Gool; Radu Timofte

arXiv:2501.18401·cs.CV·February 10, 2025

MatIR: A Hybrid Mamba-Transformer Image Restoration Model

Juan Wen, Weiyan Hou, Luc Van Gool, Radu Timofte

PDF

Open Access 1 Repo

TL;DR

MatIR is a novel hybrid image restoration model combining Mamba and Transformer architectures to leverage their respective strengths, achieving improved contextual learning and efficiency in image restoration tasks.

Contribution

This paper introduces MatIR, a hybrid Mamba-Transformer model with cross-cycled blocks and novel modules, enhancing image restoration performance beyond existing models.

Findings

01

Outperforms existing models in image restoration tasks

02

Effectively captures long-range dependencies with IRSS module

03

Demonstrates superior efficiency and accuracy in experiments

Abstract

In recent years, Transformers-based models have made significant progress in the field of image restoration by leveraging their inherent ability to capture complex contextual features. Recently, Mamba models have made a splash in the field of computer vision due to their ability to handle long-range dependencies and their significant computational efficiency compared to Transformers. However, Mamba currently lags behind Transformers in contextual learning capabilities. To overcome the limitations of these two models, we propose a Mamba-Transformer hybrid image restoration model called MatIR. Specifically, MatIR cross-cycles the blocks of the Transformer layer and the Mamba layer to extract features, thereby taking full advantage of the advantages of the two architectures. In the Mamba module, we introduce the Image Inpainting State Space (IRSS) module, which traverses along four scan…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wenjuan7275/matir
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Image and Signal Denoising Methods

MethodsAttention Is All You Need · Simple Piecewise Linear and Adaptive with Symmetric Hinges · Linear Layer · Dense Connections · Multi-Head Attention · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Softmax · Mamba: Linear-Time Sequence Modeling with Selective State Spaces