LPM: Learnable Pooling Module for Efficient Full-Face Gaze Estimation
Reo Ogusu, Takao Yamanaka

TL;DR
This paper introduces a learnable pooling module that compresses full-face images for gaze estimation, maintaining accuracy while reducing computational load by adaptively focusing on important regions.
Contribution
The novel learnable pooling module enables end-to-end training to optimize image compression for efficient and accurate full-face gaze estimation.
Findings
Maintains gaze estimation accuracy with reduced image size
Reduces computational cost through adaptive pooling
Demonstrates effectiveness on benchmark datasets
Abstract
Gaze tracking is an important technology in many domains. Techniques such as Convolutional Neural Networks (CNN) has allowed the invention of gaze tracking method that relies only on commodity hardware such as the camera on a personal computer. It has been shown that the full-face region for gaze estimation can provide better performance than from an eye image alone. However, a problem with using the full-face image is the heavy computation due to the larger image size. This study tackles this problem through compression of the input full-face image by removing redundant information using a novel learnable pooling module. The module can be trained end-to-end by backpropagation to learn the size of the grid in the pooling filter. The learnable pooling module keeps the resolution of valuable regions high and vice versa. This proposed method preserved the gaze estimation accuracy at a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Advanced Computing and Algorithms · Retinal Imaging and Analysis
