DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Yuang Ai, Xiaoqiang Zhou, Huaibo Huang, Xiaotian Han, Zhengyu Chen,, Quanzeng You, Hongxia Yang

TL;DR
DreamClear introduces a large-scale, privacy-safe dataset creation pipeline and a novel diffusion transformer-based model with adaptive modulation, significantly advancing real-world image restoration capabilities.
Contribution
The paper presents GenIR, a dual-prompt data curation pipeline for large-scale datasets, and DreamClear, a diffusion transformer model with adaptive modulation for improved real-world IR.
Findings
DreamClear outperforms existing IR models in real-world scenarios.
GenIR successfully creates a one-million-image dataset with privacy safety.
The MoAM module enhances the model's ability to handle diverse degradations.
Abstract
Image restoration (IR) in real-world scenarios presents significant challenges due to the lack of high-capacity models and comprehensive datasets. To tackle these issues, we present a dual strategy: GenIR, an innovative data curation pipeline, and DreamClear, a cutting-edge Diffusion Transformer (DiT)-based image restoration model. GenIR, our pioneering contribution, is a dual-prompt learning pipeline that overcomes the limitations of existing datasets, which typically comprise only a few thousand images and thus offer limited generalizability for larger models. GenIR streamlines the process into three stages: image-text pair construction, dual-prompt based fine-tuning, and data generation & filtering. This approach circumvents the laborious data crawling process, ensuring copyright compliance and providing a cost-effective, privacy-safe solution for IR dataset construction. The result…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
MethodsLinear Layer · Dense Connections · Multi-Head Attention · Adam · Softmax · Dropout · Absolute Position Encodings · Label Smoothing · Byte Pair Encoding · Layer Normalization
