A Real-Time Multi-Task Learning System for Joint Detection of Face, Facial Landmark and Head Pose
Qingtian Wu, Liming Zhang

TL;DR
This paper presents a real-time multi-task system based on YOLOv8 that jointly detects faces, facial landmarks, and head poses, effectively handling large-angle face poses with high efficiency.
Contribution
It introduces a multi-task detection framework extending YOLOv8 with additional landmark regression, optimized for real-time joint face, landmark, and head pose detection.
Findings
Effective large-angle face pose handling
Real-time performance across tasks
Validated on 300W-LP and AFLW2000-3D datasets
Abstract
Extreme head postures pose a common challenge across a spectrum of facial analysis tasks, including face detection, facial landmark detection (FLD), and head pose estimation (HPE). These tasks are interdependent, where accurate FLD relies on robust face detection, and HPE is intricately associated with these key points. This paper focuses on the integration of these tasks, particularly when addressing the complexities posed by large-angle face poses. The primary contribution of this study is the proposal of a real-time multi-task detection system capable of simultaneously performing joint detection of faces, facial landmarks, and head poses. This system builds upon the widely adopted YOLOv8 detection framework. It extends the original object detection head by incorporating additional landmark regression head, enabling efficient localization of crucial facial landmarks. Furthermore, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis
MethodsYou Only Look Once
