Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation
Pu Cao, Feng Zhou, Lu Yang, Tianrui Huang, Qing Song

TL;DR
This paper introduces a guidance-decoupled prior preservation method that enables high-quality, controllable in-domain image generation using label-free data and pre-trained diffusion models, broadening their application scope.
Contribution
The authors propose a novel guidance-decoupled prior preservation technique and an efficient domain knowledge learning method to enhance in-domain diffusion model generation without labeled data.
Findings
Achieves superior in-domain image synthesis quality.
Maintains open-world control and unconditional generation capabilities.
Compatible with various diffusion-based control methods.
Abstract
In-domain generation aims to perform a variety of tasks within a specific domain, such as unconditional generation, text-to-image, image editing, 3D generation, and more. Early research typically required training specialized generators for each unique task and domain, often relying on fully-labeled data. Motivated by the powerful generative capabilities and broad applications of diffusion models, we are driven to explore leveraging label-free data to empower these models for in-domain generation. Fine-tuning a pre-trained generative model on domain data is an intuitive but challenging way and often requires complex manual hyper-parameter adjustments since the limited diversity of the training data can easily disrupt the model's original generative capabilities. To address this challenge, we propose a guidance-decoupled prior preservation mechanism to achieve high generative quality and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Topic Modeling · Domain Adaptation and Few-Shot Learning
MethodsDiffusion
