Blending Concepts with Text-to-Image Diffusion Models
Lorenzo Olearo, Giorgio Longari, Alessandro Raganato, Rafael Pe\~naloza, Simone Melzi

TL;DR
This paper explores the ability of diffusion models to blend multiple concepts into new images without additional training, demonstrating their creative potential and sensitivity to input variations.
Contribution
It systematically evaluates four concept blending methods in diffusion models, revealing their strengths, limitations, and the factors influencing blending quality.
Findings
Diffusion models can blend concepts creatively without fine-tuning.
Different blending methods perform variably depending on scenarios.
Prompt order and input variations significantly affect results.
Abstract
Diffusion models have dramatically advanced text-to-image generation in recent years, translating abstract concepts into high-fidelity images with remarkable ease. In this work, we examine whether they can also blend distinct concepts, ranging from concrete objects to intangible ideas, into coherent new visual entities under a zero-shot framework. Specifically, concept blending merges the key attributes of multiple concepts (expressed as textual prompts) into a single, novel image that captures the essence of each concept. We investigate four blending methods, each exploiting different aspects of the diffusion pipeline (e.g., prompt scheduling, embedding interpolation, or layer-wise conditioning). Through systematic experimentation across diverse concept categories, such as merging concrete concepts, synthesizing compound words, transferring artistic styles, and blending architectural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Aesthetic Perception and Analysis
