Deformation Robust Text Spotting with Geometric Prior

Xixuan Hao; Aozhong Zhang; Xianze Meng; Bin Fu

arXiv:2308.16404·cs.CV·September 1, 2023

Deformation Robust Text Spotting with Geometric Prior

Xixuan Hao, Aozhong Zhang, Xianze Meng, Bin Fu

PDF

Open Access

TL;DR

This paper introduces a new dataset and a novel method for text spotting that is robust to deformation and font diversity, improving recognition of complex character shapes in natural images.

Contribution

The paper presents ARText, a large dataset with deformed and diverse fonts, and DR TextSpotter, a method using geometric priors and graph convolution for robust text recognition.

Findings

01

Effective on ARText and IC19-ReCTS datasets

02

Improves recognition of deformed and diverse fonts

03

Outperforms existing methods in robustness

Abstract

The goal of text spotting is to perform text detection and recognition in an end-to-end manner. Although the diversity of luminosity and orientation in scene texts has been widely studied, the font diversity and shape variance of the same character are ignored in recent works, since most characters in natural images are rendered in standard fonts. To solve this problem, we present a Chinese Artistic Dataset, termed as ARText, which contains 33,000 artistic images with rich shape deformation and font diversity. Based on this database, we develop a deformation robust text spotting method (DR TextSpotter) to solve the recognition problem of complex deformation of characters in different fonts. Specifically, we propose a geometric prior module to highlight the important features based on the unsupervised landmark detection sub-network. A graph convolution network is further constructed to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Human Motion and Animation

MethodsConvolution