ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting
Yuliang Liu, Chunhua Shen, Lianwen Jin, Tong He, Peng Chen, Chongyu, Liu, Hao Chen

TL;DR
ABCNet v2 introduces an adaptive Bezier curve approach for real-time, end-to-end text spotting of arbitrarily-shaped text, improving accuracy and simplicity over previous segmentation-based methods.
Contribution
The paper presents a novel Bezier curve fitting method, a BezierAlign layer, and a coordinate convolution technique, enabling structured, accurate, and efficient text detection and recognition.
Findings
Achieves state-of-the-art performance on bilingual datasets.
Maintains high efficiency with a simple pipeline.
Significantly improves recognition precision over previous methods.
Abstract
End-to-end text-spotting, which aims to integrate detection and recognition in a unified framework, has attracted increasing attention due to its simplicity of the two complimentary tasks. It remains an open problem especially when processing arbitrarily-shaped text instances. Previous methods can be roughly categorized into two groups: character-based and segmentation-based, which often require character-level annotations and/or complex post-processing due to the unstructured output. Here, we tackle end-to-end text spotting by presenting Adaptive Bezier Curve Network v2 (ABCNet v2). Our main contributions are four-fold: 1) For the first time, we adaptively fit arbitrarily-shaped text by a parameterized Bezier curve, which, compared with segmentation-based methods, can not only provide structured output but also controllable representation. 2) We design a novel BezierAlign layer for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsBezierAlign · Adaptive Bezier-Curve Network · Convolution
