TL;DR
This paper presents a novel deep neural network architecture for gaze estimation from single eye images, using an intermediate pictorial representation to improve accuracy and robustness over state-of-the-art methods.
Contribution
Introduces a new deep neural network that regresses to a pictorial representation for more accurate 3D gaze estimation from single eye images.
Findings
Achieves higher accuracy than existing methods.
Robust to variations in gaze, head pose, and image quality.
Outperforms state-of-the-art in quantitative and qualitative evaluations.
Abstract
Estimating human gaze from natural eye images only is a challenging task. Gaze direction can be defined by the pupil- and the eyeball center where the latter is unobservable in 2D images. Hence, achieving highly accurate gaze estimates is an ill-posed problem. In this paper, we introduce a novel deep neural network architecture specifically designed for the task of gaze estimation from single eye input. Instead of directly regressing two angles for the pitch and yaw of the eyeball, we regress to an intermediate pictorial representation which in turn simplifies the task of 3D gaze direction estimation. Our quantitative and qualitative results show that our approach achieves higher accuracies than the state-of-the-art and is robust to variation in gaze, head pose and image quality.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
