HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-Resolution

Shu-Chuan Chu; Zhi-Chao Dou; Jeng-Shyang Pan; Shaowei Weng; Junbao Li

arXiv:2405.05001·cs.CV·May 9, 2024

HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-Resolution

Shu-Chuan Chu, Zhi-Chao Dou, Jeng-Shyang Pan, Shaowei Weng, Junbao Li

PDF

Open Access 1 Repo

TL;DR

HMANet is a novel hybrid transformer network that combines multi-axis aggregation with a pre-training method to significantly improve image super-resolution performance over existing methods.

Contribution

The paper introduces HMANet, a hybrid architecture combining Residual Hybrid Transformer Blocks and Grid Attention Blocks, with a new pre-training strategy for enhanced super-resolution.

Findings

01

HMANet outperforms state-of-the-art super-resolution methods on benchmark datasets.

02

The hybrid architecture effectively fuses local and non-local features.

03

Pre-training enhances the model's representation capabilities.

Abstract

Transformer-based methods have demonstrated excellent performance on super-resolution visual tasks, surpassing conventional convolutional neural networks. However, existing work typically restricts self-attention computation to non-overlapping windows to save computational costs. This means that Transformer-based networks can only use input information from a limited spatial range. Therefore, a novel Hybrid Multi-Axis Aggregation network (HMA) is proposed in this paper to exploit feature potential information better. HMA is constructed by stacking Residual Hybrid Transformer Blocks(RHTB) and Grid Attention Blocks(GAB). On the one side, RHTB combines channel attention and self-attention to enhance non-local feature fusion and produce more attractive visual results. Conversely, GAB is used in cross-domain information interaction to jointly model similar features and obtain a larger…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

korouuuuu/hma
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Advanced Image Fusion Techniques

MethodsAttention Is All You Need · Dropout · Label Smoothing · Residual Connection · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Linear Layer · Byte Pair Encoding · Adam