TIPO: Text to Image with Text Presampling for Prompt Optimization
Shih-Ying Yeh, Yi Li, Sang-Hyun Park, Giyeong Oh, Xuehai Wang, Min Song, Youngjae Yu, Shang-Hong Lai

TL;DR
TIPO presents an efficient, scalable method for automatic prompt refinement in text-to-image generation, significantly improving visual quality and coherence without relying on large language models or reinforcement learning.
Contribution
Introduces a lightweight prompt expansion technique that enhances prompt detail and quality, outperforming resource-intensive methods in T2I tasks.
Findings
Achieves higher human preference rates
Reduces visual artifacts
Maintains competitive aesthetic quality
Abstract
TIPO (Text-to-Image Prompt Optimization) introduces an efficient approach for automatic prompt refinement in text-to-image (T2I) generation. Starting from simple user prompts, TIPO leverages a lightweight pre-trained model to expand these prompts into richer and more detailed versions. Conceptually, TIPO samples refined prompts from a targeted sub-distribution within the broader semantic space, preserving the original intent while significantly improving visual quality, coherence, and detail. Unlike resource-intensive methods based on large language models (LLMs) or reinforcement learning (RL), TIPO offers strong computational efficiency and scalability, opening new possibilities for effective automated prompt engineering in T2I tasks. Extensive experiments across multiple domains demonstrate that TIPO achieves stronger text alignment, reduced visual artifacts, and consistently higher…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗KBlueLeaf/TIPO-200Mmodel· 251 dl· ♡ 5251 dl♡ 5
- 🤗KBlueLeaf/TIPO-500Mmodel· 3.3k dl· ♡ 563.3k dl♡ 56
- 🤗KBlueLeaf/TIPO-200M-ftmodel· 546 dl· ♡ 20546 dl♡ 20
- 🤗KBlueLeaf/TIPO-100Mmodel· 443 dl· ♡ 8443 dl♡ 8
- 🤗KBlueLeaf/TIPO-200M-ft2model· 858 dl· ♡ 24858 dl♡ 24
- 🤗KBlueLeaf/TIPO-500M-ftmodel· 25k dl· ♡ 4525k dl♡ 45
- 🤗RichardErkhov/KBlueLeaf_-_TIPO-500M-ggufmodel· 26 dl26 dl
- 🤗RichardErkhov/KBlueLeaf_-_TIPO-200M-ft-ggufmodel· 57 dl57 dl
- 🤗RichardErkhov/KBlueLeaf_-_TIPO-200M-ggufmodel· 23 dl23 dl
- 🤗RichardErkhov/KBlueLeaf_-_TIPO-100M-ggufmodel· 36 dl· ♡ 136 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques
