UTDesign: A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images
Yiming Zhao, Yuanpeng Gao, Yuxuan Luo, Jiwei Duan, Shisong Lin, Longfei Xiong, Zhouhui Lian

TL;DR
UTDesign is a comprehensive framework that enhances stylized text editing and generation in graphic design images, supporting multiple scripts and integrating into automated design pipelines with state-of-the-art results.
Contribution
The paper introduces a novel DiT-based style transfer model and a multi-modal conditional text generation system, advancing high-precision, style-consistent text editing and synthesis in design images.
Findings
Achieves state-of-the-art stylistic consistency and text accuracy
Supports both English and Chinese scripts effectively
Outperforms existing open-source and commercial methods
Abstract
AI-assisted graphic design has emerged as a powerful tool for automating the creation and editing of design elements such as posters, banners, and advertisements. While diffusion-based text-to-image models have demonstrated strong capabilities in visual content generation, their text rendering performance, particularly for small-scale typography and non-Latin scripts, remains limited. In this paper, we propose UTDesign, a unified framework for high-precision stylized text editing and conditional text generation in design images, supporting both English and Chinese scripts. Our framework introduces a novel DiT-based text style transfer model trained from scratch on a synthetic dataset, capable of generating transparent RGBA text foregrounds that preserve the style of reference glyphs. We further extend this model into a conditional text generation framework by training a multi-modal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis
