Towards Self-Supervised Gaze Estimation
Arya Farkhondeh, Cristina Palmero, Simone Scardapane, Sergio Escalera

TL;DR
This paper introduces SwAT, an equivariant self-supervised learning method based on SwAV, which improves gaze estimation accuracy by learning more informative representations from unlabeled face images, outperforming existing methods.
Contribution
The paper proposes SwAT, a novel equivariant self-supervised approach for gaze estimation that leverages geometric transformations to enhance representation learning.
Findings
SwAT outperforms state-of-the-art gaze estimation methods.
Achieves up to 57% improvement in cross-dataset evaluation.
Achieves up to 25% improvement in within-dataset evaluation.
Abstract
Recent joint embedding-based self-supervised methods have surpassed standard supervised approaches on various image recognition tasks such as image classification. These self-supervised methods aim at maximizing agreement between features extracted from two differently transformed views of the same image, which results in learning an invariant representation with respect to appearance and geometric image transformations. However, the effectiveness of these approaches remains unclear in the context of gaze estimation, a structured regression task that requires equivariance under geometric transformations (e.g., rotations, horizontal flip). In this work, we propose SwAT, an equivariant version of the online clustering-based self-supervised approach SwAV, to learn more informative representations for gaze estimation. We demonstrate that SwAT, with ResNet-50 and supported with uncurated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Advanced Computing and Algorithms · Brain Tumor Detection and Classification
MethodsLARS · Swapping Assignments between Views
