SPG: Style-Prompting Guidance for Style-Specific Content Creation

Qian Liang; Zichong Chen; Yang Zhou; Hui Huang

arXiv:2508.11476·cs.GR·August 18, 2025

SPG: Style-Prompting Guidance for Style-Specific Content Creation

Qian Liang, Zichong Chen, Yang Zhou, Hui Huang

PDF

TL;DR

This paper introduces Style-Prompting Guidance (SPG), a new sampling strategy for controlling visual style in text-to-image diffusion models, improving style consistency without sacrificing semantic accuracy.

Contribution

The paper proposes SPG, a novel style control method that guides diffusion models using style noise vectors and integrates seamlessly with existing guidance frameworks.

Findings

01

SPG enhances style consistency in generated images.

02

The method maintains high semantic fidelity.

03

It is compatible with various controllable diffusion frameworks.

Abstract

Although recent text-to-image (T2I) diffusion models excel at aligning generated images with textual prompts, controlling the visual style of the output remains a challenging task. In this work, we propose Style-Prompting Guidance (SPG), a novel sampling strategy for style-specific image generation. SPG constructs a style noise vector and leverages its directional deviation from unconditional noise to guide the diffusion process toward the target style distribution. By integrating SPG with Classifier-Free Guidance (CFG), our method achieves both semantic fidelity and style consistency. SPG is simple, robust, and compatible with controllable frameworks like ControlNet and IPAdapter, making it practical and widely applicable. Extensive experiments demonstrate the effectiveness and generality of our approach compared to state-of-the-art methods. Code is available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.