Latent Guidance in Diffusion Models for Perceptual Evaluations
Shreshth Saini, Ru-Ling Liao, Yan Ye, Alan C. Bovik

TL;DR
This paper introduces Perceptual Manifold Guidance (PMG), a novel method guiding latent diffusion models with perceptual features to improve No-Reference Image Quality Assessment, achieving state-of-the-art results.
Contribution
It is the first to guide diffusion models with perceptual features for NR-IQA, enhancing perceptual consistency and performance.
Findings
High correlation of hyperfeatures with human perception.
State-of-the-art performance on IQA datasets.
Easy integration with existing pretrained models.
Abstract
Despite recent advancements in latent diffusion models that generate high-dimensional image data and perform various downstream tasks, there has been little exploration into perceptual consistency within these models on the task of No-Reference Image Quality Assessment (NR-IQA). In this paper, we hypothesize that latent diffusion models implicitly exhibit perceptually consistent local regions within the data manifold. We leverage this insight to guide on-manifold sampling using perceptual features and input measurements. Specifically, we propose Perceptual Manifold Guidance (PMG), an algorithm that utilizes pretrained latent diffusion models and perceptual quality features to obtain perceptually consistent multi-scale and multi-timestep feature maps from the denoising U-Net. We empirically demonstrate that these hyperfeatures exhibit high correlation with human perception in IQA tasks.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Neural Networks and Applications · Computer Graphics and Visualization Techniques
