OCR accuracy improvement on document images through a novel   pre-processing approach

Abdeslam El Harraj; Naoufal Raissouni

arXiv:1509.03456·cs.CV·September 14, 2015

OCR accuracy improvement on document images through a novel pre-processing approach

Abdeslam El Harraj, Naoufal Raissouni

PDF

TL;DR

This paper introduces a novel, unsupervised pre-processing pipeline that enhances document images by correcting distortions and improving illumination, leading to significant improvements in OCR accuracy and text detection rates.

Contribution

The paper presents a new nonparametric, unsupervised method combining multiple image enhancement techniques specifically designed to improve OCR performance on distorted document images.

Findings

01

Significant increase in OCR accuracy on standard datasets.

02

Effective handling of lighting variations and distortions.

03

Improved text detection rates compared to baseline methods.

Abstract

Digital camera and mobile document image acquisition are new trends arising in the world of Optical Character Recognition and text detection. In some cases, such process integrates many distortions and produces poorly scanned text or text-photo images and natural images, leading to an unreliable OCR digitization. In this paper, we present a novel nonparametric and unsupervised method to compensate for undesirable document image distortions aiming to optimally improve OCR accuracy. Our approach relies on a very efficient stack of document image enhancing techniques to recover deformation of the entire document image. First, we propose a local brightness and contrast adjustment method to effectively handle lighting variations and the irregular distribution of image illumination. Second, we use an optimized greyscale conversion algorithm to transform our document image to greyscale level.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.