Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild
Jiaxin Zhang, Canjie Luo, Lianwen Jin, Fengjun Guo, Kai Ding

TL;DR
Marior is a novel document dewarping pipeline that iteratively removes margins and rectifies content in a coarse-to-fine manner, effectively handling diverse real-world document images for improved OCR and visual quality.
Contribution
This work introduces Marior, a complete pipeline with margin removal and iterative content rectification modules, addressing practical challenges in dewarping documents in the wild.
Findings
Achieves state-of-the-art results on public benchmarks.
Effectively handles documents with large margins or no margins.
Adaptive iteration improves dewarping quality progressively.
Abstract
Camera-captured document images usually suffer from perspective and geometric deformations. It is of great value to rectify them when considering poor visual aesthetics and the deteriorated performance of OCR systems. Recent learning-based methods intensively focus on the accurately cropped document image. However, this might not be sufficient for overcoming practical challenges, including document images either with large marginal regions or without margins. Due to this impracticality, users struggle to crop documents precisely when they encounter large marginal regions. Simultaneously, dewarping images without margins is still an insurmountable problem. To the best of our knowledge, there is still no complete and effective pipeline for rectifying document images in the wild. To address this issue, we propose a novel approach called Marior (Margin Removal and \Iterative Content…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection
