TL;DR
This paper introduces a new large-scale dataset and a multimodal CNN-based method for appearance-based gaze estimation in natural, uncontrolled environments, demonstrating significant improvements over existing methods.
Contribution
The work provides the MPIIGaze dataset collected in real-world conditions and proposes a novel CNN approach that outperforms state-of-the-art methods in the wild.
Findings
MPIIGaze dataset with 213,659 images from real-world use
Proposed CNN method significantly outperforms existing methods in cross-dataset tests
Identified key challenges for gaze estimation in natural settings
Abstract
Appearance-based gaze estimation is believed to work well in real-world settings, but existing datasets have been collected under controlled laboratory conditions and methods have been not evaluated across multiple datasets. In this work we study appearance-based gaze estimation in the wild. We present the MPIIGaze dataset that contains 213,659 images we collected from 15 participants during natural everyday laptop use over more than three months. Our dataset is significantly more variable than existing ones with respect to appearance and illumination. We also present a method for in-the-wild appearance-based gaze estimation using multimodal convolutional neural networks that significantly outperforms state-of-the art methods in the most challenging cross-dataset evaluation. We present an extensive evaluation of several state-of-the-art image-based gaze estimation algorithms on three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
