Advanced Hough-based method for on-device document localization
D.V. Tropin, A.M. Ershov, D.P. Nikolaev, V.V. Arlazarov

TL;DR
This paper introduces an advanced Hough-based algorithm for on-device document localization that balances computational efficiency with high precision, outperforming existing methods on challenging datasets.
Contribution
It presents a novel Hough-based approach that incorporates geometric invariants and combines edge and color features, optimized for resource-constrained devices.
Findings
Achieved second-best precision on SmartDoc dataset
Guaranteed best precision on MIDV-500 dataset
Retained suitability for on-device implementation
Abstract
The demand for on-device document recognition systems increases in conjunction with the emergence of more strict privacy and security requirements. In such systems, there is no data transfer from the end device to a third-party information processing servers. The response time is vital to the user experience of on-device document recognition. Combined with the unavailability of discrete GPUs, powerful CPUs, or a large RAM capacity on consumer-grade end devices such as smartphones, the time limitations put significant constraints on the computational complexity of the applied algorithms for on-device execution. In this work, we consider document location in an image without prior knowledge of the document content or its internal structure. In accordance with the published works, at least 5 systems offer solutions for on-device document location. All these systems use a location method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Image and Object Detection Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Max Pooling · Convolution · U-Net
