HAGI++: Head-Assisted Gaze Imputation and Generation
Chuhan Jiao, Zhiming Hu, Andreas Bulling

TL;DR
HAGI++ is a novel multi-modal diffusion-based method that leverages head orientation sensors and transformer models to improve gaze data imputation and generation, outperforming existing methods in accuracy and realism.
Contribution
The paper introduces HAGI++, the first approach to incorporate head orientation sensors into gaze imputation using a transformer-based diffusion model, enhancing accuracy and enabling gaze generation with wearable device data.
Findings
HAGI++ outperforms traditional interpolation and deep learning baselines in gaze imputation.
Gaze velocity distributions from HAGI++ closely match actual human gaze behavior.
Incorporating wrist motion data allows for effective gaze generation even with complete missing data.
Abstract
Mobile eye tracking plays a vital role in capturing human visual attention across both real-world and extended reality (XR) environments, making it an essential tool for applications ranging from behavioural research to human-computer interaction. However, missing values due to blinks, pupil detection errors, or illumination changes pose significant challenges for further gaze data analysis. To address this challenge, we introduce HAGI++ - a multi-modal diffusion-based approach for gaze data imputation that, for the first time, uses the integrated head orientation sensors to exploit the inherent correlation between head and eye movements. HAGI++ employs a transformer-based diffusion model to learn cross-modal dependencies between eye and head representations and can be readily extended to incorporate additional body movements. Extensive evaluations on the large-scale Nymeria, Ego-Exo4D,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Visual Attention and Saliency Detection · Mind wandering and attention
