Efficient Facial Representations for Age, Gender and Identity Recognition in Organizing Photo Albums using Multi-output CNN
Andrey V. Savchenko

TL;DR
This paper introduces a two-stage multi-output CNN approach using a modified MobileNet for efficient facial attribute recognition and clustering in photo albums, achieving competitive accuracy with lower computational cost.
Contribution
The paper presents a novel two-stage method combining a modified MobileNet for simultaneous face recognition, age, and gender prediction, with hierarchical clustering for organizing photo albums.
Findings
Competitive clustering quality with state-of-the-art neural networks.
More accurate video-based age and gender recognition than existing models.
Reduced computational cost compared to similar approaches.
Abstract
This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. We propose the two-stage approach, in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. We modified the MobileNet, which is preliminarily trained to perform face recognition, in order to additionally recognize age and gender. In the second stage of our approach, extracted faces are grouped using hierarchical agglomerative clustering techniques. The born year and gender of a person in each cluster are estimated using aggregation of predictions for individual photos. We experimentally demonstrated that our facial clustering quality is competitive with the state-of-the-art neural networks, though our implementation is much…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
