Learning Unsupervised Gaze Representation via Eye Mask Driven Information Bottleneck
Yangzhou Jiang, Yinxin Lin, Yaoming Wang, Teng Li, Bilian Ke, Bingbing, Ni

TL;DR
This paper introduces an unsupervised gaze representation learning framework that leverages eye masking and contrastive learning to improve gaze estimation accuracy without requiring gaze annotations.
Contribution
It proposes a novel self-supervised training scheme with eye masking and contrastive loss to extract gaze features from full-face images, reducing annotation dependence.
Findings
Outperforms existing unsupervised gaze estimation methods on multiple benchmarks.
Effectively captures gaze-related features without gaze annotations.
Enhances robustness and accuracy in gaze estimation tasks.
Abstract
Appearance-based supervised methods with full-face image input have made tremendous advances in recent gaze estimation tasks. However, intensive human annotation requirement inhibits current methods from achieving industrial level accuracy and robustness. Although current unsupervised pre-training frameworks have achieved success in many image recognition tasks, due to the deep coupling between facial and eye features, such frameworks are still deficient in extracting useful gaze features from full-face. To alleviate above limitations, this work proposes a novel unsupervised/self-supervised gaze pre-training framework, which forces the full-face branch to learn a low dimensional gaze embedding without gaze annotations, through collaborative feature contrast and squeeze modules. In the heart of this framework is an alternating eye-attended/unattended masking training scheme, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Hand Gesture Recognition Systems · Brain Tumor Detection and Classification
MethodsSoftmax · Attention Is All You Need · Focus
