Can You Read Me Now? Content Aware Rectification using Angle Supervision
Amir Markovitz, Inbal Lavi, Or Perel, Shai Mazor, Roee Litman

TL;DR
This paper introduces CREASE, a content-aware document rectification method that uses angle supervision and curvature estimation to improve OCR accuracy on distorted documents, outperforming previous approaches.
Contribution
It presents the first learned rectification method leveraging document content and angle supervision, with a novel pixel-wise angle regression and curvature estimation.
Findings
Surpasses previous methods in OCR accuracy.
Reduces geometric distortion errors.
Improves visual similarity of rectified documents.
Abstract
The ubiquity of smartphone cameras has led to more and more documents being captured by cameras rather than scanned. Unlike flatbed scanners, photographed documents are often folded and crumpled, resulting in large local variance in text structure. The problem of document rectification is fundamental to the Optical Character Recognition (OCR) process on documents, and its ability to overcome geometric distortions significantly affects recognition accuracy. Despite the great progress in recent OCR systems, most still rely on a pre-process that ensures the text lines are straight and axis aligned. Recent works have tackled the problem of rectifying document images taken in-the-wild using various supervision signals and alignment means. However, they focused on global features that can be extracted from the document's boundaries, ignoring various signals that could be obtained from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
