ATD: Improved Transformer with Adaptive Token Dictionary for Image Restoration
Leheng Zhang, Wei Long, Yawei Li, Xingyu Zhou, Xiaorui Zhao, Shuhang Gu

TL;DR
The paper introduces ATD, a transformer architecture with a learnable token dictionary and cross-attention mechanism, enabling efficient global dependency modeling for image restoration tasks with improved performance and reduced complexity.
Contribution
It proposes a novel adaptive token dictionary and cross-attention mechanism for transformers, enhancing global modeling and efficiency in image restoration.
Findings
Achieves state-of-the-art results on super-resolution benchmarks.
Demonstrates effectiveness across denoising and compression artifact removal.
Lightweight ATD-light performs competitively with lower computational cost.
Abstract
Recently, Transformers have gained significant popularity in image restoration tasks such as image super-resolution and denoising, owing to their superior performance. However, balancing performance and computational burden remains a long-standing problem for transformer-based architectures. Due to the quadratic complexity of self-attention, existing methods often restrict attention to local windows, resulting in limited receptive field and suboptimal performance. To address this issue, we propose Adaptive Token Dictionary (ATD), a novel transformer-based architecture for image restoration that enables global dependency modeling with linear complexity relative to image size. The ATD model incorporates a learnable token dictionary, which summarizes external image priors (i.e., typical image structures) during the training process. To utilize this information, we introduce a token…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Image Fusion Techniques · Image and Signal Denoising Methods
