EAST: An Efficient and Accurate Scene Text Detector

Xinyu Zhou; Cong Yao; He Wen; Yuzhi Wang; Shuchang Zhou; Weiran He,; Jiajun Liang

arXiv:1704.03155·cs.CV·July 11, 2017·115 cites

EAST: An Efficient and Accurate Scene Text Detector

Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He,, Jiajun Liang

PDF

Open Access 5 Repos 1 Models

TL;DR

EAST introduces a streamlined scene text detection pipeline that directly predicts oriented text regions in images, achieving superior accuracy and speed compared to previous methods.

Contribution

The paper presents a simple, single neural network pipeline for scene text detection that eliminates intermediate steps, improving both efficiency and accuracy.

Findings

01

Achieves an F-score of 0.7820 on ICDAR 2015

02

Runs at 13.2 fps on 720p images

03

Outperforms state-of-the-art methods in accuracy and speed

Abstract

Previous approaches for scene text detection have already achieved promising performances across various benchmarks. However, they usually fall short when dealing with challenging scenarios, even when equipped with deep neural network models, because the overall performance is determined by the interplay of multiple stages and components in the pipelines. In this work, we propose a simple yet powerful pipeline that yields fast and accurate text detection in natural scenes. The pipeline directly predicts words or text lines of arbitrary orientations and quadrilateral shapes in full images, eliminating unnecessary intermediate steps (e.g., candidate aggregation and word partitioning), with a single neural network. The simplicity of our pipeline allows concentrating efforts on designing loss functions and neural network architecture. Experiments on standard datasets including ICDAR 2015,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
tangjicheng123/triton_Part_1-model_deployment
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Advanced Image and Video Retrieval Techniques