Revisiting Document Image Dewarping by Grid Regularization

Xiangwei Jiang; Rujiao Long; Nan Xue; Zhibo Yang; Cong Yao; Gui-Song; Xia

arXiv:2203.16850·eess.IV·April 6, 2022·1 cites

Revisiting Document Image Dewarping by Grid Regularization

Xiangwei Jiang, Rujiao Long, Nan Xue, Zhibo Yang, Cong Yao, Gui-Song, Xia

PDF

Open Access

TL;DR

This paper proposes a novel grid regularization approach for document image dewarping that emphasizes readability by preserving text lines and boundaries through constrained optimization, outperforming prior methods.

Contribution

It introduces a new grid regularization scheme that leverages boundary and text line information for improved dewarping without relying solely on neural network optical flow estimation.

Findings

01

Outperforms prior arts in readability metrics

02

Maintains high image quality on DocUNet benchmark

03

Effective in preserving document structure during dewarping

Abstract

This paper addresses the problem of document image dewarping, which aims at eliminating the geometric distortion in document images for document digitization. Instead of designing a better neural network to approximate the optical flow fields between the inputs and outputs, we pursue the best readability by taking the text lines and the document boundaries into account from a constrained optimization perspective. Specifically, our proposed method first learns the boundary points and the pixels in the text lines and then follows the most simple observation that the boundaries and text lines in both horizontal and vertical directions should be kept after dewarping to introduce a novel grid regularization scheme. To obtain the final forward mapping for dewarping, we solve an optimization problem with our proposed grid regularization. The experiments comprehensively demonstrate that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Digital Media Forensic Detection · Image and Video Stabilization