ControlCom: Controllable Image Composition using Diffusion Model
Bo Zhang, Yuxuan Duan, Jun Lan, Yan Hong, Huijia Zhu, Weiqiang Wang,, Li Niu

TL;DR
ControlCom introduces a unified diffusion-based approach for controllable image composition, enabling tasks like blending, harmonization, view synthesis, and generation with improved foreground fidelity and user control.
Contribution
It unifies four image composition tasks into one diffusion model and proposes a self-supervised training framework with a local enhancement module for better foreground details.
Findings
Outperforms existing methods in faithfulness and controllability
Demonstrates effectiveness on benchmark and real-world data
Enhances foreground fidelity in composite images
Abstract
Image composition targets at synthesizing a realistic composite image from a pair of foreground and background images. Recently, generative composition methods are built on large pretrained diffusion models to generate composite images, considering their great potential in image generation. However, they suffer from lack of controllability on foreground attributes and poor preservation of foreground identity. To address these challenges, we propose a controllable image composition method that unifies four tasks in one diffusion model: image blending, image harmonization, view synthesis, and generative composition. Meanwhile, we design a self-supervised training framework coupled with a tailored pipeline of training data preparation. Moreover, we propose a local enhancement module to enhance the foreground details in the diffusion model, improving the foreground fidelity of composite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Visual Attention and Saliency Detection · Aesthetic Perception and Analysis
MethodsDiffusion
