Input Aggregated Network for Face Video Representation

Zhen Dong; Su Jia; Chi Zhang; Mingtao Pei

arXiv:1603.06655·cs.CV·March 23, 2016·1 cites

Input Aggregated Network for Face Video Representation

Zhen Dong, Su Jia, Chi Zhang, Mingtao Pei

PDF

Open Access

TL;DR

This paper introduces a novel input aggregated network architecture that learns fixed-length representations from variable-length face videos, improving face video recognition performance.

Contribution

The work proposes an end-to-end network with aggregation and mapping units to effectively represent face videos, a novel approach compared to traditional image-based recognition methods.

Findings

01

Effective face video representation demonstrated on public datasets

02

Outperforms existing methods in face video recognition tasks

03

End-to-end learning system successfully captures video-specific features

Abstract

Recently, deep neural network has shown promising performance in face image recognition. The inputs of most networks are face images, and there is hardly any work reported in literature on network with face videos as input. To sufficiently discover the useful information contained in face videos, we present a novel network architecture called input aggregated network which is able to learn fixed-length representations for variable-length face videos. To accomplish this goal, an aggregation unit is designed to model a face video with various frames as a point on a Riemannian manifold, and the mapping unit aims at mapping the point into high-dimensional space where face videos belonging to the same subject are close-by and others are distant. These two units together with the frame representation unit build an end-to-end learning system which can learn representations of face videos for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Video Surveillance and Tracking Methods · Face and Expression Recognition