Arbitrary-Shaped Text Detection withAdaptive Text Region Representation
Xiufeng Jiang, Shugong Xu (Fellow, IEEE), Shunqing Zhang (Senior, Member, IEEE), and Shan Cao

TL;DR
This paper introduces a novel adaptive text region representation method that enables precise detection of arbitrarily shaped and densely packed text instances in images, overcoming limitations of traditional rectangular approaches.
Contribution
The paper proposes a new text region representation and detection pipeline that accurately detects dense, arbitrarily shaped text regions using adaptive central masks and expansion ratios.
Findings
Effective detection of arbitrarily shaped text instances.
Superior performance on standard datasets.
Precise detection of closely adjacent text.
Abstract
Text detection/localization, as an important task in computer vision, has witnessed substantialadvancements in methodology and performance with convolutional neural networks. However, the vastmajority of popular methods use rectangles or quadrangles to describe text regions. These representationshave inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries,which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text regionrepresentation method, with a robust pipeline, which can precisely detect dense adjacent text instances witharbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask anda corresponding expanding ratio between the central text region and the full text region. More specifically,our pipeline generates adaptive central text regions and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
