Neural Aggregation Network for Video Face Recognition

Jiaolong Yang; Peiran Ren; Dongqing Zhang; Dong Chen; Fang Wen,; Hongdong Li; Gang Hua

arXiv:1603.05474·cs.CV·August 3, 2017·45 cites

Neural Aggregation Network for Video Face Recognition

Jiaolong Yang, Peiran Ren, Dongqing Zhang, Dong Chen, Fang Wen,, Hongdong Li, Gang Hua

PDF

Open Access

TL;DR

This paper introduces a Neural Aggregation Network that effectively combines variable face images from videos into a fixed feature for recognition, improving accuracy without extra supervision.

Contribution

The proposed NAN uses attention-based aggregation to automatically emphasize high-quality face images, achieving state-of-the-art video face recognition performance.

Findings

01

Outperforms naive aggregation methods on benchmarks

02

Automatically learns to prioritize high-quality images

03

Achieves state-of-the-art accuracy on multiple datasets

Abstract

This paper presents a Neural Aggregation Network (NAN) for video face recognition. The network takes a face video or face image set of a person with a variable number of face images as its input, and produces a compact, fixed-dimension feature representation for recognition. The whole network is composed of two modules. The feature embedding module is a deep Convolutional Neural Network (CNN) which maps each face image to a feature vector. The aggregation module consists of two attention blocks which adaptively aggregate the feature vectors to form a single feature inside the convex hull spanned by them. Due to the attention mechanism, the aggregation is invariant to the image order. Our NAN is trained with a standard classification or verification loss without any extra supervision signal, and we found that it automatically learns to advocate high-quality face images while repelling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Face and Expression Recognition · Video Surveillance and Tracking Methods