Words as Geometric Features: Estimating Homography using Optical Character Recognition as Compressed Image Representation
Ross Greer, Alisha Ukani, Katherine Izhikevich, Earlence Fernandes, Stefan Savage, Alex C. Snoeren

TL;DR
This paper presents a novel OCR-based method for estimating homographies in document alignment, enabling effective registration using only textual features and spatial positions, especially useful when image data is unavailable.
Contribution
The paper introduces a new approach that uses OCR outputs as geometric features for homography estimation, bypassing the need for raw image data.
Findings
OCR-based method outperforms traditional image-based approaches in accuracy.
The approach is robust to OCR noise and outliers.
It offers a scalable solution for document registration when images are inaccessible.
Abstract
Document alignment and registration play a crucial role in numerous real-world applications, such as automated form processing, anomaly detection, and workflow automation. Traditional methods for document alignment rely on image-based features like keypoints, edges, and textures to estimate geometric transformations, such as homographies. However, these approaches often require access to the original document images, which may not always be available due to privacy, storage, or transmission constraints. This paper introduces a novel approach that leverages Optical Character Recognition (OCR) outputs as features for homography estimation. By utilizing the spatial positions and textual content of OCR-detected words, our method enables document alignment without relying on pixel-level image data. This technique is particularly valuable in scenarios where only OCR outputs are accessible.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques
MethodsSparse Evolutionary Training
