Attention-Set based Metric Learning for Video Face Recognition
Yibo Hu, Xiang Wu, Ran He

TL;DR
This paper introduces a novel Attention-Set based Metric Learning approach that improves video face recognition by effectively measuring and utilizing the correlations within image sets, leading to state-of-the-art results.
Contribution
The paper presents a new set-aware metric learning method with memory attention weighting, enhancing video face recognition performance.
Findings
Achieves state-of-the-art results on YouTubeFace, YouTube Celebrities, and Celebrity-1000 datasets.
Explicitly minimizes intra-set and maximizes inter-set distances.
Integrates seamlessly into CNNs for end-to-end training.
Abstract
Face recognition has made great progress with the development of deep learning. However, video face recognition (VFR) is still an ongoing task due to various illumination, low-resolution, pose variations and motion blur. Most existing CNN-based VFR methods only obtain a feature vector from a single image and simply aggregate the features in a video, which less consider the correlations of face images in one video. In this paper, we propose a novel Attention-Set based Metric Learning (ASML) method to measure the statistical characteristics of image sets. It is a promising and generalized extension of Maximum Mean Discrepancy with memory attention weighting. First, we define an effective distance metric on image sets, which explicitly minimizes the intra-set distance and maximizes the inter-set distance simultaneously. Second, inspired by Neural Turing Machine, a Memory Attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition · Biometric Identification and Security
MethodsSoftmax · Sigmoid Activation · Tanh Activation · Location-based Attention · Long Short-Term Memory · Content-based Attention · Neural Turing Machine
