DRCT: Saving Image Super-resolution away from Information Bottleneck
Chih-Chung Hsu, Chia-Ming Lee, and Yi-Shiuan Chou

TL;DR
The paper introduces DRCT, a Transformer-based super-resolution model with dense-residual connections that mitigates information bottlenecks, leading to improved performance over state-of-the-art methods.
Contribution
It proposes a novel Dense-residual-connected Transformer (DRCT) architecture to preserve spatial information and enhance super-resolution results.
Findings
Outperforms state-of-the-art super-resolution methods on benchmark datasets.
Effectively mitigates information bottleneck in Transformer models.
Achieves competitive results in NTIRE-2024 Image Super-Resolution Challenge.
Abstract
In recent years, Vision Transformer-based approaches for low-level vision tasks have achieved widespread success. Unlike CNN-based models, Transformers are more adept at capturing long-range dependencies, enabling the reconstruction of images utilizing non-local information. In the domain of super-resolution, Swin-transformer-based models have become mainstream due to their capability of global spatial information modeling and their shifting-window attention mechanism that facilitates the interchange of information between different windows. Many researchers have enhanced model performance by expanding the receptive fields or designing meticulous networks, yielding commendable results. However, we observed that it is a general phenomenon for the feature map intensity to be abruptly suppressed to small values towards the network's end. This implies an information bottleneck and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Multi-Head Attention · Adam · Byte Pair Encoding · Absolute Position Encodings · Softmax · Dense Connections · Label Smoothing
