Appearance-based Gaze Estimation With Deep Learning: A Review and Benchmark
Yihua Cheng, Haofei Wang, Yiwei Bao, Feng Lu

TL;DR
This paper provides a comprehensive review and benchmark of appearance-based gaze estimation methods using deep learning, addressing challenges in fair comparison and offering guidelines for future research.
Contribution
It systematically surveys existing methods, summarizes data processing techniques, and establishes a benchmark with datasets and source code for deep learning-based gaze estimation.
Findings
Characterized public datasets for gaze estimation
Provided source code for typical algorithms
Established a comprehensive benchmark for future research
Abstract
Human gaze provides valuable information on human focus and intentions, making it a crucial area of research. Recently, deep learning has revolutionized appearance-based gaze estimation. However, due to the unique features of gaze estimation research, such as the unfair comparison between 2D gaze positions and 3D gaze vectors and the different pre-processing and post-processing methods, there is a lack of a definitive guideline for developing deep learning-based gaze estimation algorithms. In this paper, we present a systematic review of the appearance-based gaze estimation methods using deep learning. Firstly, we survey the existing gaze estimation algorithms along the typical gaze estimation pipeline: deep feature extraction, deep learning model design, personal calibration and platforms. Secondly, to fairly compare the performance of different approaches, we summarize the data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Advanced Computing and Algorithms · Hand Gesture Recognition Systems
