Focus-Enhanced Scene Text Recognition with Deformable Convolutions

Linjie Deng; Yanxiang Gong; Xinchen Lu; Xin Yi; Zheng Ma; Mei Xie

arXiv:1908.10998·cs.CV·May 6, 2022

Focus-Enhanced Scene Text Recognition with Deformable Convolutions

Linjie Deng, Yanxiang Gong, Xinchen Lu, Xin Yi, Zheng Ma, Mei Xie

PDF

1 Repo

TL;DR

This paper introduces a focus-enhanced scene text recognition method using deformable convolutions, effectively handling irregular and distorted text without explicit rectification, and demonstrates strong performance on benchmark datasets.

Contribution

The work proposes a novel recognition network that leverages deformable convolutions to adapt to irregular text shapes without rectification steps.

Findings

01

Achieves state-of-the-art results on public benchmarks.

02

Effectively recognizes irregular and distorted text.

03

Demonstrates the effectiveness of deformable convolutions in scene text recognition.

Abstract

Recently, scene text recognition methods based on deep learning have sprung up in computer vision area. The existing methods achieved great performances, but the recognition of irregular text is still challenging due to the various shapes and distorted patterns. Consider that at the time of reading words in the real world, normally we will not rectify it in our mind but adjust our focus and visual fields. Similarly, through utilizing deformable convolutional layers whose geometric structures are adjustable, we present an enhanced recognition network without the steps of rectification to deal with irregular text in this work. A number of experiments have been applied, where the results on public benchmarks demonstrate the effectiveness of our proposed components and shows that our method has reached satisfactory performances. The code will be publicly available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Alpaca07/dtr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.