On Conditioning the Input Noise for Controlled Image Generation with   Diffusion Models

Vedant Singh; Surgan Jandial; Ayush Chopra; Siddharth Ramesh; Balaji; Krishnamurthy; Vineeth N. Balasubramanian

arXiv:2205.03859·cs.CV·May 10, 2022·5 cites

On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models

Vedant Singh, Surgan Jandial, Ayush Chopra, Siddharth Ramesh, Balaji, Krishnamurthy, Vineeth N. Balasubramanian

PDF

Open Access

TL;DR

This paper introduces a novel method for controlling diffusion-based image generation by conditioning the input noise artifacts, enabling semantic attribute-based image synthesis without modifying the diffusion process itself.

Contribution

It proposes a new technique to condition diffusion models through input noise artifacts, offering more control over generated images compared to existing methods.

Findings

01

Effective conditioning on semantic attributes demonstrated

02

Improved control over image generation quality

03

Potential for broader applications in image editing

Abstract

Conditional image generation has paved the way for several breakthroughs in image editing, generating stock photos and 3-D object generation. This continues to be a significant area of interest with the rise of new state-of-the-art methods that are based on diffusion models. However, diffusion models provide very little control over the generated image, which led to subsequent works exploring techniques like classifier guidance, that provides a way to trade off diversity with fidelity. In this work, we explore techniques to condition diffusion models with carefully crafted input noise artifacts. This allows generation of images conditioned on semantic attributes. This is different from existing approaches that input Gaussian noise and further introduce conditioning at the diffusion model's inference step. Our experiments over several examples and conditional settings show the potential…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image Retrieval and Classification Techniques · Music and Audio Processing

MethodsDiffusion