DiffDoctor: Diagnosing Image Diffusion Models Before Treating
Yiyang Wang, Xi Chen, Xiaogang Xu, Sihui Ji, Yu Liu, Yujun Shen,, Hengshuang Zhao

TL;DR
DiffDoctor introduces a two-stage approach with a robust artifact detector and pixel-level feedback to improve image diffusion models by reducing artifacts and enhancing image quality.
Contribution
The paper presents a novel two-stage pipeline with a large dataset and human-in-the-loop annotation for artifact detection and model optimization.
Findings
Effective artifact detection on over 1M images
Improved diffusion model quality with pixel-level feedback
Robustness of the proposed artifact detector
Abstract
In spite of recent progress, image diffusion models still produce artifacts. A common solution is to leverage the feedback provided by quality assessment systems or human annotators to optimize the model, where images are generally rated in their entirety. In this work, we believe problem-solving starts with identification, yielding the request that the model should be aware of not just the presence of defects in an image, but their specific locations. Motivated by this, we propose DiffDoctor, a two-stage pipeline to assist image diffusion models in generating fewer artifacts. Concretely, the first stage targets developing a robust artifact detector, for which we collect a dataset of over 1M flawed synthesized images and set up an efficient human-in-the-loop annotation process, incorporating a carefully designed class-balance strategy. The learned artifact detector is then involved in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques · Multimodal Machine Learning Applications
MethodsAttentive Walk-Aggregating Graph Neural Network · Diffusion · Sparse Evolutionary Training
