Learning a Metric Embedding for Face Recognition using the Multibatch Method
Oren Tadmor, Yonatan Wexler, Tal Rosenwein, Shai, Shalev-Shwartz, Amnon Shashua

TL;DR
This paper introduces the Multibatch training method for face recognition, significantly improving training efficiency and accuracy on embedded systems by reducing gradient variance.
Contribution
The novel Multibatch method for similarity learning reduces gradient variance and accelerates convergence, enabling high-accuracy face recognition on minimal hardware.
Findings
Achieved 98.2% accuracy on LFW benchmark
Training time of 12 hours on a Titan X GPU
Prediction runtime of 30ms on ARM Cortex A9
Abstract
This work is motivated by the engineering task of achieving a near state-of-the-art face recognition on a minimal computing budget running on an embedded system. Our main technical contribution centers around a novel training method, called Multibatch, for similarity learning, i.e., for the task of generating an invariant "face signature" through training pairs of "same" and "not-same" face images. The Multibatch method first generates signatures for a mini-batch of face images and then constructs an unbiased estimate of the full gradient by relying on all pairs from the mini-batch. We prove that the variance of the Multibatch estimator is bounded by , under some mild conditions. In contrast, the standard gradient estimator that relies on random pairs has a variance of order . The smaller variance of the Multibatch estimator significantly speeds up the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Face recognition and analysis
