Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual   Text Processing

Yan Shu; Weichao Zeng; Zhenhang Li; Fangmin Zhao; Yu Zhou

arXiv:2402.03082·cs.CV·February 6, 2024·1 cites

Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing

Yan Shu, Weichao Zeng, Zhenhang Li, Fangmin Zhao, Yu Zhou

PDF

Open Access

TL;DR

This comprehensive survey reviews recent advancements in visual text processing, emphasizing the integration of textual features, datasets, and challenges, to guide future research in this evolving field.

Contribution

It provides a hierarchical taxonomy, discusses textual feature integration, benchmarks methods on datasets, and highlights future research directions in visual text processing.

Findings

01

Effective use of textual features enhances processing tasks.

02

Benchmarking reveals strengths and gaps in current methods.

03

Identifies key challenges and future research avenues.

Abstract

Visual text, a pivotal element in both document and scene images, speaks volumes and attracts significant attention in the computer vision domain. Beyond visual text detection and recognition, the field of visual text processing has experienced a surge in research, driven by the advent of fundamental generative models. However, challenges persist due to the unique properties and features that distinguish text from general objects. Effectively leveraging these unique textual characteristics is crucial in visual text processing, as observed in our study. In this survey, we present a comprehensive, multi-perspective analysis of recent advancements in this field. Initially, we introduce a hierarchical taxonomy encompassing areas ranging from text image enhancement and restoration to text image manipulation, followed by different learning paradigms. Subsequently, we conduct an in-depth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques