FAPNet: An Effective Frequency Adaptive Point-based Eye Tracker
Xiaopeng Lin, Hongwei Ren, Bojun Cheng

TL;DR
FAPNet introduces a frequency adaptive point-based eye tracker utilizing event cameras, achieving high accuracy and efficiency by leveraging sparse, high-temporal-resolution data and novel architectural mechanisms for resource-limited environments.
Contribution
The paper proposes FAPNet, a novel frequency adaptive point-based architecture for event-based eye tracking that improves accuracy and reduces computational load on edge devices.
Findings
Achieves 97.95% accuracy on the Event-based Eye Tracking Challenge.
Consumes only 10% of PEPNet's computational resources on the SEET dataset.
Independent of sensor resolution, suitable for resource-constrained devices.
Abstract
Eye tracking is crucial for human-computer interaction in different domains. Conventional cameras encounter challenges such as power consumption and image quality during different eye movements, prompting the need for advanced solutions with ultra-fast, low-power, and accurate eye trackers. Event cameras, fundamentally designed to capture information about moving objects, exhibit low power consumption and high temporal resolution. This positions them as an alternative to traditional cameras in the realm of eye tracking. Nevertheless, existing event-based eye tracking networks neglect the pivotal sparse and fine-grained temporal information in events, resulting in unsatisfactory performance. Moreover, the energy-efficient features are further compromised by the use of excessively complex models, hindering efficient deployment on edge devices. In this paper, we utilize Point Cloud as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Robotics and Sensor-Based Localization · Retinal Imaging and Analysis
MethodsSigmoid Activation · Tanh Activation · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Long Short-Term Memory
