GL-PGENet: A Parameterized Generation Framework for Robust Document Image Enhancement
Zhihong Tang

TL;DR
GL-PGENet is a novel multi-degradation document image enhancement framework that combines hierarchical global-local refinement, parametric generation, and dense feature fusion to achieve state-of-the-art results and robustness in real-world applications.
Contribution
The paper introduces a hierarchical enhancement architecture with parametric generation and a dense fusion network, improving robustness and generalization for multi-degraded document images.
Findings
Achieves state-of-the-art SSIM scores of 0.7721 on DocUNet and 0.9480 on RealDAE.
Demonstrates excellent cross-domain adaptability and efficiency on high-resolution images.
Outperforms existing methods in real-world document image enhancement tasks.
Abstract
Document Image Enhancement (DIE) serves as a critical component in Document AI systems, where its performance substantially determines the effectiveness of downstream tasks. To address the limitations of existing methods confined to single-degradation restoration or grayscale image processing, we present Global with Local Parametric Generation Enhancement Network (GL-PGENet), a novel architecture designed for multi-degraded color document images, ensuring both efficiency and robustness in real-world scenarios. Our solution incorporates three key innovations: First, a hierarchical enhancement framework that integrates global appearance correction with local refinement, enabling coarse-to-fine quality improvement. Second, a Dual-Branch Local-Refine Network with parametric generation mechanisms that replaces conventional direct prediction, producing enhanced outputs through learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Generative Adversarial Networks and Image Synthesis · Handwritten Text Recognition Techniques
