DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics
Aniket Roy, Maiterya Suin, Anshul Shah, Ketul Shah, Jiang Liu, Rama, Chellappa

TL;DR
DIFFNAT introduces a kurtosis concentration loss that leverages natural image statistics to enhance diffusion model-generated image quality without extra guidance, validated across multiple tasks.
Contribution
Proposes a novel KC loss based on natural image kurtosis properties, applicable to any diffusion model to improve image quality.
Findings
Improved perceptual quality across tasks
Enhanced FID and MUSIQ scores
Positive user evaluations
Abstract
Diffusion models have advanced generative AI significantly in terms of editing and creating naturalistic images. However, efficiently improving generated image quality is still of paramount interest. In this context, we propose a generic "naturalness" preserving loss function, viz., kurtosis concentration (KC) loss, which can be readily applied to any standard diffusion model pipeline to elevate the image quality. Our motivation stems from the projected kurtosis concentration property of natural images, which states that natural images have nearly constant kurtosis values across different band-pass versions of the image. To retain the "naturalness" of the generated images, we enforce reducing the gap between the highest and lowest kurtosis values across the band-pass versions (e.g., Discrete Wavelet Transform (DWT)) of images. Note that our approach does not require any additional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Generative Adversarial Networks and Image Synthesis
MethodsMUSIQ · Diffusion
