StyleDrop: Text-to-Image Generation in Any Style

Kihyuk Sohn; Nataniel Ruiz; Kimin Lee; Daniel Castro Chin; Irina Blok,; Huiwen Chang; Jarred Barber; Lu Jiang; Glenn Entis; Yuanzhen Li; Yuan Hao,; Irfan Essa; Michael Rubinstein; Dilip Krishnan

arXiv:2306.00983·cs.CV·June 2, 2023·25 cites

StyleDrop: Text-to-Image Generation in Any Style

Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok,, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao,, Irfan Essa, Michael Rubinstein, Dilip Krishnan

PDF

Open Access 4 Repos 2 Models

TL;DR

StyleDrop is a versatile method that enables text-to-image models to faithfully generate images in specific styles, capturing detailed nuances with minimal training data and parameters.

Contribution

It introduces StyleDrop, a novel fine-tuning approach that efficiently learns and reproduces complex styles using very few parameters and minimal data, outperforming existing methods.

Findings

01

StyleDrop outperforms DreamBooth and textual inversion in style transfer quality.

02

It effectively captures detailed style nuances from a single image.

03

The method requires less than 1% of model parameters for training.

Abstract

Pre-trained large text-to-image models synthesize impressive images with an appropriate use of text prompts. However, ambiguities inherent in natural language and out-of-distribution effects make it hard to synthesize image styles, that leverage a specific design pattern, texture or material. In this paper, we introduce StyleDrop, a method that enables the synthesis of images that faithfully follow a specific style using a text-to-image model. The proposed method is extremely versatile and captures nuances and details of a user-provided style, such as color schemes, shading, design patterns, and local and global effects. It efficiently learns a new style by fine-tuning very few trainable parameters (less than $1%$ of total model parameters) and improving the quality via iterative training with either human or automated feedback. Better yet, StyleDrop is able to deliver impressive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · Computer Graphics and Visualization Techniques