TL;DR
FaceNet introduces a deep learning system that maps face images into a Euclidean space for efficient face recognition, verification, and clustering, achieving state-of-the-art accuracy with compact embeddings.
Contribution
The paper presents a novel deep convolutional network trained with triplet loss and online triplet mining, producing highly efficient face embeddings for recognition tasks.
Findings
Achieves 99.63% accuracy on LFW dataset.
Reduces error rate by 30% compared to previous methods.
Uses only 128 bytes per face for state-of-the-art performance.
Abstract
Despite significant recent advances in the field of face recognition, implementing face verification and recognition efficiently at scale presents serious challenges to current approaches. In this paper we present a system, called FaceNet, that directly learns a mapping from face images to a compact Euclidean space where distances directly correspond to a measure of face similarity. Once this space has been produced, tasks such as face recognition, verification and clustering can be easily implemented using standard techniques with FaceNet embeddings as feature vectors. Our method uses a deep convolutional network trained to directly optimize the embedding itself, rather than an intermediate bottleneck layer as in previous deep learning approaches. To train, we use triplets of roughly aligned matching / non-matching face patches generated using a novel online triplet mining method.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods3D Convolution
