High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer
Mingxian Li, Hao Sun, Yingtie Lei, Xiaofeng Zhang, Yihang Dong, Yilin, Zhou, Zimeng Li, Xuhang Chen

TL;DR
This paper introduces StainDoc, a large-scale dataset for document stain removal, and proposes StainRestorer, a memory-augmented Transformer that effectively removes stains while preserving document details.
Contribution
The paper presents the first high-resolution dataset for document stain removal and a novel Transformer-based model with memory augmentation for improved stain removal performance.
Findings
StainRestorer outperforms existing methods on the StainDoc dataset.
The dataset includes over 5,000 paired stained and clean document images.
Memory-augmented Transformer effectively captures hierarchical stain features.
Abstract
Document images are often degraded by various stains, significantly impacting their readability and hindering downstream applications such as document digitization and analysis. The absence of a comprehensive stained document dataset has limited the effectiveness of existing document enhancement methods in removing stains while preserving fine-grained details. To address this challenge, we construct StainDoc, the first large-scale, high-resolution () dataset specifically designed for document stain removal. StainDoc comprises over 5,000 pairs of stained and clean document images across multiple scenes. This dataset encompasses a diverse range of stain types, severities, and document backgrounds, facilitating robust training and evaluation of document stain removal algorithms. Furthermore, we propose StainRestorer, a Transformer-based document stain removal approach.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Industrial Vision Systems and Defect Detection
MethodsLinear Layer · Dense Connections · Label Smoothing · Layer Normalization · Residual Connection · Byte Pair Encoding · Absolute Position Encodings · Attention Is All You Need · Multi-Head Attention · Softmax
