Character Keypoint-based Homography Estimation in Scanned Documents for Efficient Information Extraction
Kushagra Mahajan, Monika Sharma, Lovekesh Vig

TL;DR
This paper introduces a fast, accurate homography estimation method for scanned documents using character-based keypoints and OCR, improving alignment for information extraction tasks.
Contribution
The paper presents a novel character keypoint-based algorithm leveraging OCR for precise homography estimation in document images, addressing limitations of traditional feature-based methods.
Findings
High accuracy in aligning scanned documents
Effective in real-world insurance claim datasets
Outperforms traditional feature-based approaches
Abstract
Precise homography estimation between multiple images is a pre-requisite for many computer vision applications. One application that is particularly relevant in today's digital era is the alignment of scanned or camera-captured document images such as insurance claim forms for information extraction. Traditional learning based approaches perform poorly due to the absence of an appropriate gradient. Feature based keypoint extraction techniques for homography estimation in real scene images either detect an extremely large number of inconsistent keypoints due to sharp textual edges, or produce inaccurate keypoint correspondences due to variations in illumination and viewpoint differences between document images. In this paper, we propose a novel algorithm for aligning scanned or camera-captured document images using character based keypoints and a reference template. The algorithm is both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
