DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael, Rubinstein, Kfir Aberman

TL;DR
DreamBooth introduces a fine-tuning method for text-to-image diffusion models that personalizes them to generate photorealistic images of specific subjects in diverse contexts using only a few reference images.
Contribution
The paper presents a novel fine-tuning approach with a class-specific prior preservation loss for subject-driven image generation, enabling high-quality personalization with minimal data.
Findings
Effective subject recontextualization and view synthesis.
Preserves key features of subjects across diverse scenes.
Outperforms previous methods in personalized image generation.
Abstract
Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. However, these models lack the ability to mimic the appearance of subjects in a given reference set and synthesize novel renditions of them in different contexts. In this work, we present a new approach for "personalization" of text-to-image diffusion models. Given as input just a few images of a subject, we fine-tune a pretrained text-to-image model such that it learns to bind a unique identifier with that specific subject. Once the subject is embedded in the output domain of the model, the unique identifier can be used to synthesize novel photorealistic images of the subject contextualized in different scenes. By leveraging the semantic prior embedded in the model with a new autogenous class-specific prior preservation loss, our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗FredZhang7/paint-journey-v2model· 92 dl· ♡ 3692 dl♡ 36
- 🤗alkzar90/ppaine-landscapemodel· 6 dl· ♡ 166 dl♡ 16
- 🤗PAIR/text2video-zero-controlnet-canny-gta5model· 14 dl· ♡ 1414 dl♡ 14
- 🤗PAIR/text2video-zero-controlnet-canny-animemodel· 14 dl· ♡ 2014 dl♡ 20
- 🤗PAIR/text2video-zero-controlnet-canny-arcanemodel· 5 dl· ♡ 315 dl♡ 31
- 🤗PAIR/text2video-zero-controlnet-canny-avatarmodel· 7 dl· ♡ 107 dl♡ 10
- 🤗drmeeseeks/dreambooth_diffusion_modelmodel· 1 dl1 dl
- 🤗drmeeseeks/dreambooth_diffusion_model-v2model· 1 dl1 dl
- 🤗drmeeseeks/dreambooth_diffusion_model-v3model· 5 dl5 dl
- 🤗atharva98/vwcar-mk8-loramodel· ♡ 1♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Image Processing and 3D Reconstruction
MethodsDiffusion
