HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
Xiang Zhang, Yulun Zhang, Fisher Yu

TL;DR
This paper introduces HiT-SR, a hierarchical transformer architecture for image super-resolution that enhances multi-scale feature extraction and long-range dependencies while maintaining computational efficiency, leading to state-of-the-art results.
Contribution
The paper proposes a hierarchical transformer design with expanding windows and a linear complexity spatial-channel correlation method for efficient, high-performance image super-resolution.
Findings
Achieves state-of-the-art SR performance with fewer parameters.
Faster processing speed, approximately 7 times quicker.
Effective multi-scale feature aggregation and long-range dependency modeling.
Abstract
Transformers have exhibited promising performance in computer vision tasks including image super-resolution (SR). However, popular transformer-based SR methods often employ window self-attention with quadratic computational complexity to window sizes, resulting in fixed small windows with limited receptive fields. In this paper, we present a general strategy to convert transformer-based SR networks to hierarchical transformers (HiT-SR), boosting SR performance with multi-scale features while maintaining an efficient design. Specifically, we first replace the commonly used fixed small windows with expanding hierarchical windows to aggregate features at different scales and establish long-range dependencies. Considering the intensive computation required for large windows, we further design a spatial-channel correlation method with linear complexity to window sizes, efficiently gathering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗XiangZ/hit-srf-4x-df2kmodel· 198 dl· ♡ 2198 dl♡ 2
- 🤗XiangZ/hit-sir-2xmodel· 8 dl8 dl
- 🤗XiangZ/hit-sir-3xmodel· 1 dl1 dl
- 🤗XiangZ/hit-sir-4xmodel
- 🤗XiangZ/hit-sng-2xmodel· 2 dl2 dl
- 🤗XiangZ/hit-sng-3xmodel
- 🤗XiangZ/hit-sng-4xmodel· 1 dl1 dl
- 🤗XiangZ/hit-srf-2xmodel
- 🤗XiangZ/hit-srf-3xmodel
- 🤗XiangZ/hit-srf-4xmodel· 3 dl3 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Image and Signal Denoising Methods
