DIFFNAT: Improving Diffusion Image Quality Using Natural Image   Statistics

Aniket Roy; Maiterya Suin; Anshul Shah; Ketul Shah; Jiang Liu; Rama; Chellappa

arXiv:2311.09753·cs.CV·November 17, 2023·1 cites

DIFFNAT: Improving Diffusion Image Quality Using Natural Image Statistics

Aniket Roy, Maiterya Suin, Anshul Shah, Ketul Shah, Jiang Liu, Rama, Chellappa

PDF

Open Access

TL;DR

DIFFNAT introduces a kurtosis concentration loss that leverages natural image statistics to enhance diffusion model-generated image quality without extra guidance, validated across multiple tasks.

Contribution

Proposes a novel KC loss based on natural image kurtosis properties, applicable to any diffusion model to improve image quality.

Findings

01

Improved perceptual quality across tasks

02

Enhanced FID and MUSIQ scores

03

Positive user evaluations

Abstract

Diffusion models have advanced generative AI significantly in terms of editing and creating naturalistic images. However, efficiently improving generated image quality is still of paramount interest. In this context, we propose a generic "naturalness" preserving loss function, viz., kurtosis concentration (KC) loss, which can be readily applied to any standard diffusion model pipeline to elevate the image quality. Our motivation stems from the projected kurtosis concentration property of natural images, which states that natural images have nearly constant kurtosis values across different band-pass versions of the image. To retain the "naturalness" of the generated images, we enforce reducing the gap between the highest and lowest kurtosis values across the band-pass versions (e.g., Discrete Wavelet Transform (DWT)) of images. Note that our approach does not require any additional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Generative Adversarial Networks and Image Synthesis

MethodsMUSIQ · Diffusion