A maximal-information color to gray conversion method for document images: Toward an optimal grayscale representation for document image binarization
Reza Farrahi Moghaddam, Shaohua Chen, Rachid Hedjam, Mohamed Cheriet

TL;DR
This paper introduces a novel color-to-gray conversion method that enhances document image binarization by maximizing information retention and selecting optimal gray representations, improving contrast and readability.
Contribution
The proposed dual transform method optimally balances color channel information for improved document binarization, incorporating a homogeneity-based channel selection and color reduction preprocessing.
Findings
Enhanced binarization performance on multiple datasets
Improved contrast and readability in grayscale images
Effective color reduction and channel selection strategy
Abstract
A novel method to convert color/multi-spectral images to gray-level images is introduced to increase the performance of document binarization methods. The method uses the distribution of the pixel data of the input document image in a color space to find a transformation, called the dual transform, which balances the amount of information on all color channels. Furthermore, in order to reduce the intensity variations on the gray output, a color reduction preprocessing step is applied. Then, a channel is selected as the gray value representation of the document image based on the homogeneity criterion on the text regions. In this way, the proposed method can provide a luminance-independent contrast enhancement. The performance of the method is evaluated against various images from two databases, the ICDAR'03 Robust Reading, the KAIST and the DIBCO'09 datasets, subjectively and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Vehicle License Plate Recognition
