Deep Pictorial Gaze Estimation

Seonwook Park; Adrian Spurr; Otmar Hilliges

arXiv:1807.10002·cs.CV·December 31, 2018

Deep Pictorial Gaze Estimation

Seonwook Park, Adrian Spurr, Otmar Hilliges

PDF

1 Repo

TL;DR

This paper presents a novel deep neural network architecture for gaze estimation from single eye images, using an intermediate pictorial representation to improve accuracy and robustness over state-of-the-art methods.

Contribution

Introduces a new deep neural network that regresses to a pictorial representation for more accurate 3D gaze estimation from single eye images.

Findings

01

Achieves higher accuracy than existing methods.

02

Robust to variations in gaze, head pose, and image quality.

03

Outperforms state-of-the-art in quantitative and qualitative evaluations.

Abstract

Estimating human gaze from natural eye images only is a challenging task. Gaze direction can be defined by the pupil- and the eyeball center where the latter is unobservable in 2D images. Hence, achieving highly accurate gaze estimates is an ill-posed problem. In this paper, we introduce a novel deep neural network architecture specifically designed for the task of gaze estimation from single eye input. Instead of directly regressing two angles for the pitch and yaw of the eyeball, we regress to an intermediate pictorial representation which in turn simplifies the task of 3D gaze direction estimation. Our quantitative and qualitative results show that our approach achieves higher accuracies than the state-of-the-art and is robust to variation in gaze, head pose and image quality.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiamenwcy/pictorial_net
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.