TL;DR
DE-GAN is a novel conditional GAN framework designed to restore severely degraded document images, significantly improving OCR performance across various degradation types and outperforming existing methods on standard datasets.
Contribution
This paper introduces DE-GAN, the first application of conditional GANs for comprehensive document enhancement tasks, demonstrating superior restoration quality.
Findings
Outperforms state-of-the-art methods on DIBCO and H-DIBCO datasets.
Effectively restores degraded documents across multiple degradation types.
Enhances OCR accuracy by improving document image quality.
Abstract
Documents often exhibit various forms of degradation, which make it hard to be read and substantially deteriorate the performance of an OCR system. In this paper, we propose an effective end-to-end framework named Document Enhancement Generative Adversarial Networks (DE-GAN) that uses the conditional GANs (cGANs) to restore severely degraded document images. To the best of our knowledge, this practice has not been studied within the context of generative adversarial deep networks. We demonstrate that, in different tasks (document clean up, binarization, deblurring and watermark removal), DE-GAN can produce an enhanced version of the degraded document with a high quality. In addition, our approach provides consistent improvements compared to state-of-the-art methods over the widely used DIBCO 2013, DIBCO 2017 and H-DIBCO 2018 datasets, proving its ability to restore a degraded document…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDE-GAN: A Conditional Generative Adversarial Network for Document Enhancement
