RealHePoNet: a robust single-stage ConvNet for head pose estimation in   the wild

Rafael Berral-Soler; Francisco J. Madrid-Cuevas; Rafael; Mu\~noz-Salinas; Manuel J. Mar\'in-Jim\'enez

arXiv:2011.01890·cs.CV·November 4, 2020

RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild

Rafael Berral-Soler, Francisco J. Madrid-Cuevas, Rafael, Mu\~noz-Salinas, Manuel J. Mar\'in-Jim\'enez

PDF

1 Repo

TL;DR

This paper introduces RealHePoNet, a single-stage ConvNet that accurately and efficiently estimates head pose angles in real-world images without facial landmarks, suitable for practical applications.

Contribution

The work presents a robust, fast ConvNet model trained on combined datasets for real-world head pose estimation without landmarks, achieving low error and inference time.

Findings

01

Average error of ~4.4° on test data

02

Inference time of ~6 ms per image

03

Effective on low-resolution grayscale images

Abstract

Human head pose estimation in images has applications in many fields such as human-computer interaction or video surveillance tasks. In this work, we address this problem, defined here as the estimation of both vertical (tilt/pitch) and horizontal (pan/yaw) angles, through the use of a single Convolutional Neural Network (ConvNet) model, trying to balance precision and inference speed in order to maximize its usability in real-world applications. Our model is trained over the combination of two datasets: 'Pointing'04' (aiming at covering a wide range of poses) and 'Annotated Facial Landmarks in the Wild' (in order to improve robustness of our model for its use on real-world images). Three different partitions of the combined dataset are defined and used for training, validation and testing purposes. As a result of this work, we have obtained a trained ConvNet model, coined RealHePoNet,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rafabs97/headpose_final
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.