Learning a Metric Embedding for Face Recognition using the Multibatch   Method

Oren Tadmor; Yonatan Wexler; Tal Rosenwein; Shai; Shalev-Shwartz; Amnon Shashua

arXiv:1605.07270·cs.CV·May 25, 2016·20 cites

Learning a Metric Embedding for Face Recognition using the Multibatch Method

Oren Tadmor, Yonatan Wexler, Tal Rosenwein, Shai, Shalev-Shwartz, Amnon Shashua

PDF

Open Access

TL;DR

This paper introduces the Multibatch training method for face recognition, significantly improving training efficiency and accuracy on embedded systems by reducing gradient variance.

Contribution

The novel Multibatch method for similarity learning reduces gradient variance and accelerates convergence, enabling high-accuracy face recognition on minimal hardware.

Findings

01

Achieved 98.2% accuracy on LFW benchmark

02

Training time of 12 hours on a Titan X GPU

03

Prediction runtime of 30ms on ARM Cortex A9

Abstract

This work is motivated by the engineering task of achieving a near state-of-the-art face recognition on a minimal computing budget running on an embedded system. Our main technical contribution centers around a novel training method, called Multibatch, for similarity learning, i.e., for the task of generating an invariant "face signature" through training pairs of "same" and "not-same" face images. The Multibatch method first generates signatures for a mini-batch of $k$ face images and then constructs an unbiased estimate of the full gradient by relying on all $k^{2} - k$ pairs from the mini-batch. We prove that the variance of the Multibatch estimator is bounded by $O (1/ k^{2})$ , under some mild conditions. In contrast, the standard gradient estimator that relies on random $k /2$ pairs has a variance of order $1/ k$ . The smaller variance of the Multibatch estimator significantly speeds up the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Face recognition and analysis