Fast constrained sampling in pre-trained diffusion models
Alexandros Graikos, Nebojsa Jojic, Dimitris Samaras

TL;DR
This paper introduces a fast, efficient algorithm for constrained image generation using pre-trained diffusion models, enabling high-quality results under various constraints without extensive retraining.
Contribution
The authors propose an approximation to Newton's method that accelerates constrained sampling in diffusion models, avoiding backpropagation and improving speed and reliability.
Findings
Achieves faster inference than existing methods
Produces high-quality images under diverse constraints
Surpasses or matches state-of-the-art training-free approaches
Abstract
Large denoising diffusion models, such as Stable Diffusion, have been trained on billions of image-caption pairs to perform text-conditioned image generation. As a byproduct of this training, these models have acquired general knowledge about image statistics, which can be useful for other inference tasks. However, when confronted with sampling an image under new constraints, e.g. generating the missing parts of an image, using large pre-trained text-to-image diffusion models is inefficient and often unreliable. Previous approaches either utilized backpropagation through the denoiser network, making them significantly slower and more memory-demanding than simple text-to-image generation, or only enforced the constraint locally, failing to capture critical long-range correlations in the sampled image. In this work, we propose an algorithm that enables fast, high-quality generation under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Neuroimaging Techniques and Applications · NMR spectroscopy and applications · Advanced Mathematical Modeling in Engineering
MethodsDiffusion
