TL;DR
Gaze360 introduces a large-scale, unconstrained 3D gaze estimation dataset and a novel model that incorporates temporal data and gaze uncertainty, advancing robustness and cross-dataset generalization in gaze tracking.
Contribution
The paper presents Gaze360, the largest publicly available unconstrained gaze dataset, and a new 3D gaze model with temporal and uncertainty estimation capabilities.
Findings
Model outperforms existing methods in cross-dataset evaluation.
Inclusion of temporal information improves gaze estimation accuracy.
Self-supervised domain adaptation enhances cross-dataset generalization.
Abstract
Understanding where people are looking is an informative social cue. In this work, we present Gaze360, a large-scale gaze-tracking dataset and method for robust 3D gaze estimation in unconstrained images. Our dataset consists of 238 subjects in indoor and outdoor environments with labelled 3D gaze across a wide range of head poses and distances. It is the largest publicly available dataset of its kind by both subject and variety, made possible by a simple and efficient collection method. Our proposed 3D gaze model extends existing models to include temporal information and to directly output an estimate of gaze uncertainty. We demonstrate the benefits of our model via an ablation study, and show its generalization performance via a cross-dataset evaluation against other recent gaze benchmark datasets. We furthermore propose a simple self-supervised approach to improve cross-dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
