Co-designing a Sub-millisecond Latency Event-based Eye Tracking System with Submanifold Sparse CNN
Baoheng Zhang, Yizhao Gao, Jingyuan Li, Hayden Kwok-Hay So

TL;DR
This paper presents a co-designed event-based eye-tracking system using a sparse CNN accelerator on FPGA, achieving sub-millisecond latency, high accuracy, and low power consumption for VR/AR applications.
Contribution
It introduces a novel hardware/software co-design leveraging sparse CNNs and FPGA acceleration for ultra-low latency eye tracking.
Findings
Achieves 0.7 ms latency and 2.29 mJ per inference.
Attains 81% p5 accuracy and 99.5% p10 accuracy on the dataset.
Demonstrates effective processing of sparse event data with specialized FPGA architecture.
Abstract
Eye-tracking technology is integral to numerous consumer electronics applications, particularly in the realm of virtual and augmented reality (VR/AR). These applications demand solutions that excel in three crucial aspects: low-latency, low-power consumption, and precision. Yet, achieving optimal performance across all these fronts presents a formidable challenge, necessitating a balance between sophisticated algorithms and efficient backend hardware implementations. In this study, we tackle this challenge through a synergistic software/hardware co-design of the system with an event camera. Leveraging the inherent sparsity of event-based input data, we integrate a novel sparse FPGA dataflow accelerator customized for submanifold sparse convolution neural networks (SCNN). The SCNN implemented on the accelerator can efficiently extract the embedding feature vector from each representation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optical Imaging Technologies · Gaze Tracking and Assistive Technology
MethodsConvolution
