Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Zhihong Pan, Xin Zhou, Hao Tian

TL;DR
This paper introduces a novel style guidance technique for diffusion-based text-to-image models, enabling arbitrary style control using reference images or self-guidance, enhancing diversity and style fidelity without additional style transfer models.
Contribution
It presents a new style guidance method that allows arbitrary style specification in diffusion models without needing separate style transfer, improving flexibility and diversity.
Findings
Effective in diverse graphic art styles
Maintains high image quality with style control
Robust across various models and content types
Abstract
Diffusion-based text-to-image generation models like GLIDE and DALLE-2 have gained wide success recently for their superior performance in turning complex text inputs into images of high quality and wide diversity. In particular, they are proven to be very powerful in creating graphic arts of various formats and styles. Although current models supported specifying style formats like oil painting or pencil drawing, fine-grained style features like color distributions and brush strokes are hard to specify as they are randomly picked from a conditional distribution based on the given text input. Here we propose a novel style guidance method to support generating images using arbitrary style guided by a reference image. The generation method does not require a separate style transfer model to generate desired styles while maintaining image quality in generated content as controlled by the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation· youtube
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation· youtube
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Video Analysis and Summarization · Computer Graphics and Visualization Techniques
MethodsDiffusion · Guided Language to Image Diffusion for Generation and Editing
