LDRNet: Enabling Real-time Document Localization on Mobile Devices
Han Wu, Holland Qian, Huaming Wu, Aad van Moorsel

TL;DR
LDRNet is a lightweight, real-time document localization model for mobile devices that significantly improves speed while maintaining accuracy, enabling more efficient ID verification processes without human intervention.
Contribution
The paper introduces LDRNet, a novel lightweight model with unique prediction branches and loss functions, optimized for real-time document localization on mobile devices.
Findings
LDRNet achieves up to 790 FPS, 47 times faster than existing methods.
LDRNet maintains comparable accuracy with other approaches in document localization.
The model is suitable for various mobile applications requiring fast document detection.
Abstract
While Identity Document Verification (IDV) technology on mobile devices becomes ubiquitous in modern business operations, the risk of identity theft and fraud is increasing. The identity document holder is normally required to participate in an online video interview to circumvent impostors. However, the current IDV process depends on an additional human workforce to support online step-by-step guidance which is inefficient and expensive. The performance of existing AI-based approaches cannot meet the real-time and lightweight demands of mobile devices. In this paper, we address those challenges by designing an edge intelligence-assisted approach for real-time IDV. Aiming at improving the responsiveness of the IDV process, we propose a new document localization model for mobile devices, LDRNet, to Localize the identity Document in Real-time. On the basis of a lightweight backbone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital and Cyber Forensics · Advanced Malware Detection Techniques · Web Data Mining and Analysis
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
