Simultaneous face detection and 360 degree headpose estimation
Hoang Nguyen Viet, Linh Nguyen Viet, Tuan Nguyen Dinh, Duc Tran Minh,, Long Tran Quoc

TL;DR
This paper introduces Multitask-Net, a deep learning model that simultaneously detects faces and estimates 360-degree head poses by sharing features between tasks, improving accuracy over separate models.
Contribution
The novel Multitask-Net model integrates face detection and head pose estimation into a single multitask framework, leveraging shared features for enhanced performance.
Findings
Achieves accurate 360-degree head pose estimation.
Improves face detection and head pose accuracy through shared features.
Handles large Euler angle variations with rotation matrix representation.
Abstract
With many practical applications in human life, including manufacturing surveillance cameras, analyzing and processing customer behavior, many researchers are noticing face detection and head pose estimation on digital images. A large number of proposed deep learning models have state-of-the-art accuracy such as YOLO, SSD, MTCNN, solving the problem of face detection or HopeNet, FSA-Net, RankPose model used for head pose estimation problem. According to many state-of-the-art methods, the pipeline of this task consists of two parts, from face detection to head pose estimation. These two steps are completely independent and do not share information. This makes the model clear in setup but does not leverage most of the featured resources extracted in each model. In this paper, we proposed the Multitask-Net model with the motivation to leverage the features extracted from the face detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Face and Expression Recognition
MethodsYou Only Look Once · 1x1 Convolution · Convolution · Non Maximum Suppression · SSD
