Real-time Memory Efficient Large-pose Face Alignment via Deep   Evolutionary Network

Bin Sun; Ming Shao; Siyu Xia; Yun Fu

arXiv:1910.11818·cs.CV·November 4, 2019·6 cites

Real-time Memory Efficient Large-pose Face Alignment via Deep Evolutionary Network

Bin Sun, Ming Shao, Siyu Xia, Yun Fu

PDF

Open Access

TL;DR

This paper introduces a memory-efficient, real-time face alignment method capable of handling large pose variations by combining 3D diffusion heap maps with a deep evolutionary network, significantly improving speed and accuracy.

Contribution

The authors propose a novel deep evolutionary model with 3D DHM and an efficient network structure, enabling fast, accurate face alignment under extreme poses.

Findings

01

Model is 6 times faster than state-of-the-art on CPU

02

Model is 14 times faster than state-of-the-art on GPU

03

Outperforms existing methods especially in large-pose scenarios

Abstract

There is an urgent need to apply face alignment in a memory-efficient and real-time manner due to the recent explosion of face recognition applications. However, impact factors such as large pose variation and computational inefficiency, still hinder its broad implementation. To this end, we propose a computationally efficient deep evolutionary model integrated with 3D Diffusion Heap Maps (DHM). First, we introduce a sparse 3D DHM to assist the initial modeling process under extreme pose conditions. Afterward, a simple and effective CNN feature is extracted and fed to Recurrent Neural Network (RNN) for evolutionary learning. To accelerate the model, we propose an efficient network structure to accelerate the evolutionary learning process through a factorization strategy. Extensive experiments on three popular alignment databases demonstrate the advantage of the proposed models over the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Video Surveillance and Tracking Methods · Generative Adversarial Networks and Image Synthesis

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings