GazeProphet: Software-Only Gaze Prediction for VR Foveated Rendering
Farhaan Ebadulla, Chiraag Mudlapur, Gaurav BV

TL;DR
GazeProphet is a novel software-only gaze prediction method for VR foveated rendering that combines vision transformers and LSTMs, achieving high accuracy without hardware eye tracking.
Contribution
It introduces a multi-modal neural network that predicts gaze locations in VR without requiring dedicated eye tracking hardware, enabling more accessible foveated rendering.
Findings
Median angular error of 3.83 degrees
Outperforms saliency-based baselines by 24%
Reliable confidence calibration across scenes
Abstract
Foveated rendering significantly reduces computational demands in virtual reality applications by concentrating rendering quality where users focus their gaze. Current approaches require expensive hardware-based eye tracking systems, limiting widespread adoption due to cost, calibration complexity, and hardware compatibility constraints. This paper presents GazeProphet, a software-only approach for predicting gaze locations in VR environments without requiring dedicated eye tracking hardware. The approach combines a Spherical Vision Transformer for processing 360-degree VR scenes with an LSTM-based temporal encoder that captures gaze sequence patterns. A multi-modal fusion network integrates spatial scene features with temporal gaze dynamics to predict future gaze locations with associated confidence estimates. Experimental evaluation on a comprehensive VR dataset demonstrates that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Virtual Reality Applications and Impacts · Hand Gesture Recognition Systems
