NLML-HPE: Head Pose Estimation with Limited Data via Manifold Learning
Mahdi Ghafourian, Federico M. Sukno

TL;DR
This paper introduces NLML-HPE, a deep learning method that uses manifold learning and tensor decomposition to accurately estimate head pose angles from limited data, outperforming traditional approaches.
Contribution
The paper presents a novel approach combining tensor decomposition and neural networks for head pose estimation as a regression problem with limited data.
Findings
Achieved real-time head pose estimation with limited training data.
Generated a precise 2D head pose dataset from 3D models.
Effectively modeled pose angles as cosine curves on manifolds.
Abstract
Head pose estimation (HPE) plays a critical role in various computer vision applications such as human-computer interaction and facial recognition. In this paper, we propose a novel deep learning approach for head pose estimation with limited training data via non-linear manifold learning called NLML-HPE. This method is based on the combination of tensor decomposition (i.e., Tucker decomposition) and feed forward neural networks. Unlike traditional classification-based approaches, our method formulates head pose estimation as a regression problem, mapping input landmarks into a continuous representation of pose angles. To this end, our method uses tensor decomposition to split each Euler angle (yaw, pitch, roll) to separate subspaces and models each dimension of the underlying manifold as a cosine curve. We address two key challenges: 1. Almost all HPE datasets suffer from incorrect and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Face recognition and analysis · Hand Gesture Recognition Systems
