DRCT: Saving Image Super-resolution away from Information Bottleneck

Chih-Chung Hsu; Chia-Ming Lee; and Yi-Shiuan Chou

arXiv:2404.00722·cs.CV·November 26, 2024·1 cites

DRCT: Saving Image Super-resolution away from Information Bottleneck

Chih-Chung Hsu, Chia-Ming Lee, and Yi-Shiuan Chou

PDF

Open Access 1 Repo 1 Models

TL;DR

The paper introduces DRCT, a Transformer-based super-resolution model with dense-residual connections that mitigates information bottlenecks, leading to improved performance over state-of-the-art methods.

Contribution

It proposes a novel Dense-residual-connected Transformer (DRCT) architecture to preserve spatial information and enhance super-resolution results.

Findings

01

Outperforms state-of-the-art super-resolution methods on benchmark datasets.

02

Effectively mitigates information bottleneck in Transformer models.

03

Achieves competitive results in NTIRE-2024 Image Super-Resolution Challenge.

Abstract

In recent years, Vision Transformer-based approaches for low-level vision tasks have achieved widespread success. Unlike CNN-based models, Transformers are more adept at capturing long-range dependencies, enabling the reconstruction of images utilizing non-local information. In the domain of super-resolution, Swin-transformer-based models have become mainstream due to their capability of global spatial information modeling and their shifting-window attention mechanism that facilitates the interchange of information between different windows. Many researchers have enhanced model performance by expanding the receptive fields or designing meticulous networks, yielding commendable results. However, we observed that it is a general phenomenon for the feature map intensity to be abruptly suppressed to small values towards the network's end. This implies an information bottleneck and a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ming053l/drct
pytorchOfficial

Models

🤗
aaronespasa/drct-super-resolution
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Multi-Head Attention · Adam · Byte Pair Encoding · Absolute Position Encodings · Softmax · Dense Connections · Label Smoothing