TextEditBench: Evaluating Reasoning-aware Text Editing Beyond Rendering
Rui Gui, Yang Wan, Haochen Han, Dongxing Mao, Fangming Liu, Min Li, Alex Jinpeng Wang

TL;DR
TextEditBench is a new benchmark designed to evaluate reasoning-aware text editing in images, focusing on semantic, geometric, and contextual coherence, revealing current models' limitations in complex editing scenarios.
Contribution
We introduce TextEditBench, a comprehensive benchmark with a novel Semantic Expectation dimension to assess reasoning and coherence in text editing within images.
Findings
Current models excel at simple instructions
Models struggle with context-dependent reasoning
Physical and layout consistency remain challenging
Abstract
Text rendering has recently emerged as one of the most challenging frontiers in visual generation, drawing significant attention from large-scale diffusion and multimodal models. However, text editing within images remains largely unexplored, as it requires generating legible characters while preserving semantic, geometric, and contextual coherence. To fill this gap, we introduce TextEditBench, a comprehensive evaluation benchmark that explicitly focuses on text-centric regions in images. Beyond basic pixel manipulations, our benchmark emphasizes reasoning-intensive editing scenarios that require models to understand physical plausibility, linguistic meaning, and cross-modal dependencies. We further propose a novel evaluation dimension, Semantic Expectation (SE), which measures reasoning ability of model to maintain semantic consistency, contextual coherence, and cross-modal alignment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Digital Humanities and Scholarship
