WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition
Zheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, Junjie Huang, Xinze, Chen, Jiagang Zhu, Tian Yang, Jiwen Lu, Dalong Du, Jie Zhou

TL;DR
This paper introduces WebFace260M, a large-scale face recognition benchmark with a cleaned subset WebFace42M, along with evaluation protocols, enabling significant advancements in million-scale face recognition research.
Contribution
It presents the largest public face recognition training set, a scalable cleaning pipeline, and comprehensive benchmarks for evaluating face matchers under practical constraints.
Findings
Reduced 40% failure rate on IJB-C with WebFace42M
Achieved 3rd place on NIST-FRVT leaderboard
Demonstrated superior performance with only 10% of data
Abstract
In this paper, we contribute a new million-scale face benchmark containing noisy 4M identities/260M faces (WebFace260M) and cleaned 2M identities/42M faces (WebFace42M) training data, as well as an elaborately designed time-constrained evaluation protocol. Firstly, we collect 4M name list and download 260M faces from the Internet. Then, a Cleaning Automatically utilizing Self-Training (CAST) pipeline is devised to purify the tremendous WebFace260M, which is efficient and scalable. To the best of our knowledge, the cleaned WebFace42M is the largest public face recognition training set and we expect to close the data gap between academia and industry. Referring to practical scenarios, Face Recognition Under Inference Time conStraint (FRUITS) protocol and a test set are constructed to comprehensively evaluate face matchers. Equipped with this benchmark, we delve into million-scale face…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Biometric Identification and Security · Face and Expression Recognition
MethodsDepthwise Convolution · Pointwise Convolution · Softmax · Depthwise Separable Convolution · Max Pooling · Average Pooling · Residual Block · Grouped Convolution · Sigmoid Activation · Global Average Pooling
