OrienText: Surface Oriented Textual Image Generation

Shubham Singh Paliwal; Arushi Jain; Monika Sharma; Vikram Jamwal; Lovekesh Vig

arXiv:2505.20958·cs.CV·May 28, 2025

OrienText: Surface Oriented Textual Image Generation

Shubham Singh Paliwal, Arushi Jain, Monika Sharma, Vikram Jamwal, Lovekesh Vig

PDF

TL;DR

OrienText is a novel diffusion-based method that improves text placement and orientation accuracy on complex surfaces in images, enhancing applications in e-commerce and advertising.

Contribution

It introduces the use of region-specific surface normals as conditional input to guide text placement in diffusion models, addressing a key challenge in text-to-image generation.

Findings

01

Effective text rendering on complex surfaces demonstrated

02

Outperforms existing textual image generation methods

03

Improves text orientation accuracy in generated images

Abstract

Textual content in images is crucial in e-commerce sectors, particularly in marketing campaigns, product imaging, advertising, and the entertainment industry. Current text-to-image (T2I) generation diffusion models, though proficient at producing high-quality images, often struggle to incorporate text accurately onto complex surfaces with varied perspectives, such as angled views of architectural elements like buildings, banners, or walls. In this paper, we introduce the Surface Oriented Textual Image Generation (OrienText) method, which leverages region-specific surface normals as conditional input to T2I generation diffusion model. Our approach ensures accurate rendering and correct orientation of the text within the image context. We demonstrate the effectiveness of the OrienText method on a self-curated dataset of images and compare it against the existing textual image generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.