StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Synthesis

Peter Schaldenbrand; Zhixuan Liu; Jean Oh

arXiv:2111.03133·cs.CV·March 2, 2022·1 cites

StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Synthesis

Peter Schaldenbrand, Zhixuan Liu, Jean Oh

PDF

Open Access 1 Repo

TL;DR

StyleCLIPDraw enhances text-to-drawing synthesis by integrating style control, enabling artistic customization of generated images in both texture and shape through a coupled style transfer approach.

Contribution

It introduces a novel coupled style loss to the CLIPDraw model, allowing simultaneous control of content and style in generated drawings.

Findings

01

Coupled style transfer captures style in texture and shape.

02

The method improves artistic control over generated images.

03

Code and more results are publicly available.

Abstract

Generating images that fit a given text description using machine learning has improved greatly with the release of technologies such as the CLIP image-text encoder model; however, current methods lack artistic control of the style of image to be generated. We introduce StyleCLIPDraw which adds a style loss to the CLIPDraw text-to-drawing synthesis model to allow artistic control of the synthesized drawings in addition to control of the content via text. Whereas performing decoupled style transfer on a generated image only affects the texture, our proposed coupled approach is able to capture a style in both texture and shape, suggesting that the style of the drawing is coupled with the drawing process itself. More results and our code are available at https://github.com/pschaldenbrand/StyleCLIPDraw

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pschaldenbrand/styleclipdraw
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Human Motion and Animation

MethodsContrastive Language-Image Pre-training