Benchmarking Fast Data Platforms for the Aadhaar Biometric Database
Yogesh Simmhan, Anshu Shukla, Arun Verma

TL;DR
This paper benchmarks fast data processing platforms for Aadhaar, the world's largest biometric database, focusing on handling high-volume, high-velocity biometric data streams for social services in India.
Contribution
It provides a comprehensive evaluation of data platforms' performance in managing Aadhaar's large-scale biometric data streams.
Findings
Identifies the most efficient platforms for Aadhaar's data processing needs.
Highlights challenges in scaling biometric data streams.
Offers insights into optimizing big data architectures for biometric applications.
Abstract
Aadhaar is the world's largest biometric database with a billion records, being compiled as an identity platform to deliver social services to residents of India.Aadhaar processes streams of biometric data as residents are enrolled and updated.Besides 1 million enrolments and updates per day,up to 100 million daily biometric authentications are expected during delivery of various public services.These form critical Big Data applications,with large volumes and high velocity of data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
