MaxSR: Image Super-Resolution Using Improved MaxViT
Bincheng Yang, Gangshan Wu

TL;DR
MaxSR leverages an improved MaxViT transformer architecture to enhance single image super-resolution, effectively capturing self-similarity and hierarchical features, leading to state-of-the-art results.
Contribution
The paper introduces adaptive attention mechanisms within MaxViT blocks for super-resolution, improving global self-similarity modeling and achieving superior performance.
Findings
MaxSR achieves new state-of-the-art results in super-resolution.
MaxSR-light provides efficient super-resolution with competitive performance.
Improved attention mechanisms enhance global feature modeling.
Abstract
While transformer models have been demonstrated to be effective for natural language processing tasks and high-level vision tasks, only a few attempts have been made to use powerful transformer models for single image super-resolution. Because transformer models have powerful representation capacity and the in-built self-attention mechanisms in transformer models help to leverage self-similarity prior in input low-resolution image to improve performance for single image super-resolution, we present a single image super-resolution model based on recent hybrid vision transformer of MaxViT, named as MaxSR. MaxSR consists of four parts, a shallow feature extraction block, multiple cascaded adaptive MaxViT blocks to extract deep hierarchical features and model global self-similarity from low-level features efficiently, a hierarchical feature fusion block, and finally a reconstruction block.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Image Fusion Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Linear Layer · Layer Normalization · Residual Connection · Dense Connections · Vision Transformer · Hierarchical Feature Fusion
