MAGE: A Multi-task Architecture for Gaze Estimation with an Efficient Calibration Module
Haoming Huang, Musen Zhang, Jianxin Yang, Zhen Li, Jinkai Li, Yao Guo

TL;DR
MAGE is a multi-task neural network architecture that estimates 6-DoF gaze in 3D space, incorporating an efficient calibration module to adapt to individual differences, advancing gaze analysis in human-robot interaction.
Contribution
The paper introduces MAGE, a novel multi-task model with a calibration module for accurate 6-DoF gaze estimation applicable in real-world HRI scenarios.
Findings
Achieves state-of-the-art performance on multiple datasets.
Effectively adapts to individual eye variations.
Provides comprehensive 3D gaze information.
Abstract
Eye gaze can provide rich information on human psychological activities, and has garnered significant attention in the field of Human-Robot Interaction (HRI). However, existing gaze estimation methods merely predict either the gaze direction or the Point-of-Gaze (PoG) on the screen, failing to provide sufficient information for a comprehensive six Degree-of-Freedom (DoF) gaze analysis in 3D space. Moreover, the variations of eye shape and structure among individuals also impede the generalization capability of these methods. In this study, we propose MAGE, a Multi-task Architecture for Gaze Estimation with an efficient calibration module, to predict the 6-DoF gaze information that is applicable for the real-word HRI. Our basic model encodes both the directional and positional features from facial images, and predicts gaze results with dedicated information flow and multiple decoders. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Visual Attention and Saliency Detection · Social Robot Interaction and HRI
MethodsSoftmax · Attention Is All You Need
