TL;DR
This paper introduces UDBNET, an unsupervised document binarization method using a three-player adversarial game, which generates pseudo paired data to improve binarization without requiring labeled datasets.
Contribution
The paper proposes a novel three-player adversarial framework for unsupervised document binarization, enlarging datasets and improving performance without paired training data.
Findings
Outperforms state-of-the-art on DIBCO datasets
Generates diverse degraded images for training
Enforces distribution alignment via joint discriminator
Abstract
Degraded document image binarization is one of the most challenging tasks in the domain of document image analysis. In this paper, we present a novel approach towards document image binarization by introducing three-player min-max adversarial game. We train the network in an unsupervised setup by assuming that we do not have any paired-training data. In our approach, an Adversarial Texture Augmentation Network (ATANet) first superimposes the texture of a degraded reference image over a clean image. Later, the clean image along with its generated degraded version constitute the pseudo paired-data which is used to train the Unsupervised Document Binarization Network (UDBNet). Following this approach, we have enlarged the document binarization datasets as it generates multiple images having same content feature but different textual feature. These generated noisy images are then fed into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
