Multistage Model for Robust Face Alignment Using Deep Neural Networks

Huabin Wang; Rui Cheng; Jian Zhou; Liang Tao; Hon Keung; Kwan

arXiv:2002.01075·cs.CV·February 5, 2020·1 cites

Multistage Model for Robust Face Alignment Using Deep Neural Networks

Huabin Wang, Rui Cheng, Jian Zhou, Liang Tao, Hon Keung, Kwan

PDF

Open Access

TL;DR

This paper introduces a multistage deep neural network model for robust face alignment under challenging conditions like occlusions and pose variations, combining spatial transformer networks, hourglass networks, and shape constraints.

Contribution

It presents a novel multistage framework integrating spatial transformer GANs, hourglass networks, and exemplar-based shape constraints for improved face alignment accuracy.

Findings

01

Outperforms state-of-the-art methods on benchmark datasets.

02

Effectively handles occlusions and pose variations.

03

Improves landmark localization accuracy.

Abstract

An ability to generalize unconstrained conditions such as severe occlusions and large pose variations remains a challenging goal to achieve in face alignment. In this paper, a multistage model based on deep neural networks is proposed which takes advantage of spatial transformer networks, hourglass networks and exemplar-based shape constraints. First, a spatial transformer - generative adversarial network which consists of convolutional layers and residual units is utilized to solve the initialization issues caused by face detectors, such as rotation and scale variations, to obtain improved face bounding boxes for face alignment. Then, stacked hourglass network is employed to obtain preliminary locations of landmarks as well as their corresponding scores. In addition, an exemplar-based shape dictionary is designed to determine landmarks with low scores based on those with high scores.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Face and Expression Recognition

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Spatial Transformer · 1x1 Convolution · Max Pooling · Convolution · Hourglass Module · Stacked Hourglass Network · Residual Connection