WebFace260M: A Benchmark for Million-Scale Deep Face Recognition
Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, Junjie Huang, Xinze, Chen, Jiagang Zhu, Tian Yang, Dalong Du, Jiwen Lu, Jie Zhou

TL;DR
This paper introduces WebFace260M, a large-scale, uncurated face recognition benchmark with 260 million faces, along with a cleaned subset, to advance research in high-performance, scalable face recognition systems.
Contribution
It presents the largest public face recognition training set, a scalable cleaning pipeline, and comprehensive evaluation protocols for standard, masked, and unbiased face recognition scenarios.
Findings
Reduced 40% failure rate on IJB-C dataset
Achieved 3rd place among 430 entries on NIST-FRVT
Demonstrated superior performance with only 10% data (WebFace4M)
Abstract
Face benchmarks empower the research community to train and evaluate high-performance face recognition systems. In this paper, we contribute a new million-scale recognition benchmark, containing uncurated 4M identities/260M faces (WebFace260M) and cleaned 2M identities/42M faces (WebFace42M) training data, as well as an elaborately designed time-constrained evaluation protocol. Firstly, we collect 4M name lists and download 260M faces from the Internet. Then, a Cleaning Automatically utilizing Self-Training (CAST) pipeline is devised to purify the tremendous WebFace260M, which is efficient and scalable. To the best of our knowledge, the cleaned WebFace42M is the largest public face recognition training set and we expect to close the data gap between academia and industry. Referring to practical deployments, Face Recognition Under Inference Time conStraint (FRUITS) protocol and a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Biometric Identification and Security · Face and Expression Recognition
