Benchmarking recognition results on word image datasets

Deepak Kumar; M N Anil Prasad; A G Ramakrishnan

arXiv:1208.6137·cs.CV·August 31, 2012·5 cites

Benchmarking recognition results on word image datasets

Deepak Kumar, M N Anil Prasad, A G Ramakrishnan

PDF

Open Access

TL;DR

This paper benchmarks the maximum recognition accuracy on various word image datasets using manual segmentation and commercial OCR, highlighting the impact of image degradations and annotation methods.

Contribution

It introduces a semi-automated pixel-level segmentation tool and provides a comprehensive benchmark across multiple datasets with different image qualities.

Findings

01

Recognition rates range from 79.6% to 89.3% across datasets.

02

Pixel-level annotation offers advantages for recognition accuracy.

03

Degradations and distortions significantly affect OCR performance.

Abstract

We have benchmarked the maximum obtainable recognition accuracy on various word image datasets using manual segmentation and a currently available commercial OCR. We have developed a Matlab program, with graphical user interface, for semi-automated pixel level segmentation of word images. We discuss the advantages of pixel level annotation. We have covered five databases adding up to over 3600 word images. These word images have been cropped from camera captured scene, born-digital and street view images. We recognize the segmented word image using the trial version of Nuance Omnipage OCR. We also discuss, how the degradations introduced during acquisition or inaccuracies introduced during creation of word images affect the recognition of the word present in the image. Word images for different kinds of degradations and correction for slant and curvy nature of words are also discussed.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Image Processing and 3D Reconstruction