GazeProphetV2: Head-Movement-Based Gaze Prediction Enabling Efficient Foveated Rendering on Mobile VR
Farhaan Ebadulla, Chiraag Mudlpaur, Shreya Chaurasia, Gaurav BV

TL;DR
GazeProphetV2 introduces a multimodal gaze prediction method combining gaze history, head movement, and scene data with attention mechanisms, significantly improving accuracy for VR rendering and interaction without expensive eye tracking.
Contribution
The paper presents a novel multimodal approach with cross-modal attention for gaze prediction in VR, enhancing accuracy and generalization over previous single-modality methods.
Findings
Improved gaze prediction accuracy across multiple VR scenes.
High temporal consistency in predicted gaze trajectories.
Effective generalization to unseen scenes with 93.1% validation accuracy.
Abstract
Predicting gaze behavior in virtual reality environments remains a significant challenge with implications for rendering optimization and interface design. This paper introduces a multimodal approach to VR gaze prediction that combines temporal gaze patterns, head movement data, and visual scene information. By leveraging a gated fusion mechanism with cross-modal attention, the approach learns to adaptively weight gaze history, head movement, and scene content based on contextual relevance. Evaluations using a dataset spanning 22 VR scenes with 5.3M gaze samples demonstrate improvements in predictive accuracy when combining modalities compared to using individual data streams alone. The results indicate that integrating past gaze trajectories with head orientation and scene content enhances prediction accuracy across 1-3 future frames. Cross-scene generalization testing shows consistent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Visual Attention and Saliency Detection · Mind wandering and attention
