Confidence Score for Unsupervised Foreground Background Separation of   Document Images

Soumyadeep Dey; Pratik Jawanpuria

arXiv:2204.04044·cs.CV·April 11, 2022

Confidence Score for Unsupervised Foreground Background Separation of Document Images

Soumyadeep Dey, Pratik Jawanpuria

PDF

Open Access

TL;DR

This paper introduces a new method to compute confidence scores for unsupervised foreground-background separation in document images, enhancing the interpretability and utility of binarization algorithms without increasing computational complexity.

Contribution

The paper presents a novel approach for confidence scoring in unsupervised document image binarization that maintains the same computational complexity as existing methods.

Findings

01

Confidence scores improve document binarization quality.

02

Scores assist in document cleanup and texture addition tasks.

03

Method is computationally efficient and compatible with existing algorithms.

Abstract

Foreground-background separation is an important problem in document image analysis. Popular unsupervised binarization methods (such as the Sauvola's algorithm) employ adaptive thresholding to classify pixels as foreground or background. In this work, we propose a novel approach for computing confidence scores of the classification in such algorithms. This score provides an insight of the confidence level of the prediction. The computational complexity of the proposed approach is the same as the underlying binarization algorithm. Our experiments illustrate the utility of the proposed scores in various applications like document binarization, document image cleanup, and texture addition.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Handwritten Text Recognition Techniques · Advanced Image and Video Retrieval Techniques