Learning Unsupervised Gaze Representation via Eye Mask Driven   Information Bottleneck

Yangzhou Jiang; Yinxin Lin; Yaoming Wang; Teng Li; Bilian Ke; Bingbing; Ni

arXiv:2407.00315·cs.CV·July 2, 2024

Learning Unsupervised Gaze Representation via Eye Mask Driven Information Bottleneck

Yangzhou Jiang, Yinxin Lin, Yaoming Wang, Teng Li, Bilian Ke, Bingbing, Ni

PDF

Open Access

TL;DR

This paper introduces an unsupervised gaze representation learning framework that leverages eye masking and contrastive learning to improve gaze estimation accuracy without requiring gaze annotations.

Contribution

It proposes a novel self-supervised training scheme with eye masking and contrastive loss to extract gaze features from full-face images, reducing annotation dependence.

Findings

01

Outperforms existing unsupervised gaze estimation methods on multiple benchmarks.

02

Effectively captures gaze-related features without gaze annotations.

03

Enhances robustness and accuracy in gaze estimation tasks.

Abstract

Appearance-based supervised methods with full-face image input have made tremendous advances in recent gaze estimation tasks. However, intensive human annotation requirement inhibits current methods from achieving industrial level accuracy and robustness. Although current unsupervised pre-training frameworks have achieved success in many image recognition tasks, due to the deep coupling between facial and eye features, such frameworks are still deficient in extracting useful gaze features from full-face. To alleviate above limitations, this work proposes a novel unsupervised/self-supervised gaze pre-training framework, which forces the full-face branch to learn a low dimensional gaze embedding without gaze annotations, through collaborative feature contrast and squeeze modules. In the heart of this framework is an alternating eye-attended/unattended masking training scheme, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaze Tracking and Assistive Technology · Hand Gesture Recognition Systems · Brain Tumor Detection and Classification

MethodsSoftmax · Attention Is All You Need · Focus