Image preprocessing and modified adaptive thresholding for improving OCR

Rohan Lal Kshetry

arXiv:2111.14075·cs.CV·February 10, 2022

Image preprocessing and modified adaptive thresholding for improving OCR

Rohan Lal Kshetry

PDF

Open Access

TL;DR

This paper introduces a novel image preprocessing technique that enhances OCR accuracy by isolating text features and applying adaptive thresholding based on pixel intensity, validated through experiments with PyTesseract.

Contribution

The proposed method uniquely combines feature removal and adaptive thresholding based on pixel intensity to improve OCR performance.

Findings

01

Enhanced OCR accuracy with the proposed preprocessing method.

02

Effective isolation of text features improves thresholding results.

03

Validated improvements using PyTesseract on processed images.

Abstract

In this paper I have proposed a method to find the major pixel intensity inside the text and thresholding an image accordingly to make it easier to be used for optical character recognition (OCR) models. In our method, instead of editing whole image, I are removing all other features except the text boundaries and the color filling them. In this approach, the grayscale intensity of the letters from the input image are used as one of thresholding parameters. The performance of the developed model is finally validated with input images, with and without image processing followed by OCR by PyTesseract. Based on the results obtained, it can be observed that this algorithm can be efficiently applied in the field of image processing for OCR.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Image and Video Stabilization