Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration
Liyan Wang, Qinyu Yang, Cong Wang, Wei Wang, Jinshan Pan, Zhixun Su

TL;DR
This paper introduces a novel coarse-to-fine diffusion Transformer (C2F-DFT) for image restoration, combining diffusion self-attention and feed-forward networks within a new training scheme to improve restoration quality.
Contribution
The paper proposes a new coarse-to-fine training scheme with diffusion Transformer components, enhancing noise estimation and hierarchy diffusion representation for better image restoration.
Findings
Outperforms IR-SDE in diffusion-based restoration tasks.
Achieves competitive results with state-of-the-art Transformer methods.
Effective in deraining, deblurring, and denoising tasks.
Abstract
Recent years have witnessed the remarkable performance of diffusion models in various vision tasks. However, for image restoration that aims to recover clear images with sharper details from given degraded observations, diffusion-based methods may fail to recover promising results due to inaccurate noise estimation. Moreover, simple constraining noises cannot effectively learn complex degradation information, which subsequently hinders the model capacity. To solve the above problems, we propose a coarse-to-fine diffusion Transformer (C2F-DFT) for image restoration. Specifically, our C2F-DFT contains diffusion self-attention (DFSA) and diffusion feed-forward network (DFN) within a new coarse-to-fine training scheme. The DFSA and DFN respectively capture the long-range diffusion dependencies and learn hierarchy diffusion representation to facilitate better restoration. In the coarse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMRI in cancer diagnosis · Image and Signal Denoising Methods · Sparse and Compressive Sensing Techniques
MethodsMulti-Head Attention · Attention Is All You Need · fail · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Label Smoothing · Layer Normalization · Absolute Position Encodings
