Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing
Yan Shu, Weichao Zeng, Zhenhang Li, Fangmin Zhao, Yu Zhou

TL;DR
This comprehensive survey reviews recent advancements in visual text processing, emphasizing the integration of textual features, datasets, and challenges, to guide future research in this evolving field.
Contribution
It provides a hierarchical taxonomy, discusses textual feature integration, benchmarks methods on datasets, and highlights future research directions in visual text processing.
Findings
Effective use of textual features enhances processing tasks.
Benchmarking reveals strengths and gaps in current methods.
Identifies key challenges and future research avenues.
Abstract
Visual text, a pivotal element in both document and scene images, speaks volumes and attracts significant attention in the computer vision domain. Beyond visual text detection and recognition, the field of visual text processing has experienced a surge in research, driven by the advent of fundamental generative models. However, challenges persist due to the unique properties and features that distinguish text from general objects. Effectively leveraging these unique textual characteristics is crucial in visual text processing, as observed in our study. In this survey, we present a comprehensive, multi-perspective analysis of recent advancements in this field. Initially, we introduce a hierarchical taxonomy encompassing areas ranging from text image enhancement and restoration to text image manipulation, followed by different learning paradigms. Subsequently, we conduct an in-depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
