Enhance to Read Better: A Multi-Task Adversarial Network for Handwritten Document Image Enhancement
Sana Khamekhem Jemni, Mohamed Ali Souibgui, Yousri Kessentini, and Alicia Forn\'es

TL;DR
This paper introduces a novel multi-task GAN architecture that enhances degraded handwritten document images by integrating a text recognizer, significantly improving readability and visual quality, and outperforming existing methods in benchmark challenges.
Contribution
It is the first to incorporate text recognition into GAN-based document binarization, improving both readability and visual quality of degraded handwritten documents.
Findings
Improved readability of degraded handwritten documents.
Outperformed state-of-the-art in H-DIBCO challenges.
Effective on Arabic and Latin handwritten datasets.
Abstract
Handwritten document images can be highly affected by degradation for different reasons: Paper ageing, daily-life scenarios (wrinkles, dust, etc.), bad scanning process and so on. These artifacts raise many readability issues for current Handwritten Text Recognition (HTR) algorithms and severely devalue their efficiency. In this paper, we propose an end to end architecture based on Generative Adversarial Networks (GANs) to recover the degraded documents into a clean and readable form. Unlike the most well-known document binarization methods, which try to improve the visual quality of the degraded document, the proposed architecture integrates a handwritten text recognizer that promotes the generated document image to be more readable. To the best of our knowledge, this is the first work to use the text information while binarizing handwritten documents. Extensive experiments conducted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Infrastructure Maintenance and Monitoring · Vehicle License Plate Recognition
