Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Minxing Luo, Zixun Xia, Liaojun Chen, Zhenhang Li, Weichao Zeng,, Jianye Wang, Wentao Cheng, Yaxing Wang, Yu Zhou, Jian Yang

TL;DR
This paper introduces STGen, a training-free framework that improves visual text generation in complex layouts like slanted or curved texts by harmonizing text with backgrounds using semantic rectification and structure injection.
Contribution
STGen is a novel training-free approach that decomposes visual text generation into semantic rectification and structure reinforcement, effectively handling challenging text layouts.
Findings
Achieves higher accuracy in complex text layouts
Produces more harmonious and visually appealing text images
Outperforms existing methods in quality and fidelity
Abstract
In real-world images, slanted or curved texts, especially those on cans, banners, or badges, appear as frequently, if not more so, than flat texts due to artistic design or layout constraints. While high-quality visual text generation has become available with the advanced generative capabilities of diffusion models, these models often produce distorted text and inharmonious text background when given slanted or curved text layouts due to training data limitation. In this paper, we introduce a new training-free framework, STGen, which accurately generates visual texts in challenging scenarios (\eg, slanted or curved text layouts) while harmonizing them with the text background. Our framework decomposes the visual text generation process into two branches: (i) \textbf{Semantic Rectification Branch}, which leverages the ability in generating flat but accurate visual texts of the model to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Humanities and Scholarship · Artificial Intelligence in Games · Digital Games and Media
MethodsDiffusion
