Deep Architectures for Face Attributes
Tobi Baumgartner, Jack Culpepper

TL;DR
This paper develops a deep convolutional neural network trained on a new dataset of public figures to classify identity and facial attributes like age, gender, ethnicity, and emotion, optimizing resource sharing and fine-tuning strategies.
Contribution
It introduces a new dataset, explores optimal resource sharing within the network, and compares different fine-tuning depths for attribute prediction.
Findings
Fine-tuning from earlier layers improves age and emotion prediction.
Only 1 G flops are needed for all predictions.
Optimal sharing pattern reduces computational cost.
Abstract
We train a deep convolutional neural network to perform identity classification using a new dataset of public figures annotated with age, gender, ethnicity and emotion labels, and then fine-tune it for attribute classification. An optimal sharing pattern of computational resources within this network is determined by experiment, requiring only 1 G flops to produce all predictions. Rather than fine-tune by relearning weights in one additional layer after the penultimate layer of the identity network, we try several different depths for each attribute. We find that prediction of age and emotion is improved by fine-tuning from earlier layers onward, presumably because deeper layers are progressively invariant to non-identity related changes in the input.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face and Expression Recognition
