Unsupervised Gaze-aware Contrastive Learning with Subject-specific   Condition

Lingyu Du; Xucong Zhang; Guohao Lan

arXiv:2309.04506·cs.CV·September 12, 2023

Unsupervised Gaze-aware Contrastive Learning with Subject-specific Condition

Lingyu Du, Xucong Zhang, Guohao Lan

PDF

Open Access

TL;DR

ConGaze introduces an unsupervised contrastive learning framework for gaze estimation that uses subject-specific conditions and gaze-preserving augmentations to learn effective gaze representations from unlabeled facial images.

Contribution

It proposes a novel unsupervised contrastive learning method with a subject-conditional projection module and gaze-specific augmentation for improved gaze estimation.

Findings

01

Outperforms existing unsupervised methods by 6.7% to 22.5%.

02

Achieves 15.1% to 24.6% improvement over supervised models in cross-dataset tests.

03

Effectively learns gaze-aware features without large annotated datasets.

Abstract

Appearance-based gaze estimation has shown great promise in many applications by using a single general-purpose camera as the input device. However, its success is highly depending on the availability of large-scale well-annotated gaze datasets, which are sparse and expensive to collect. To alleviate this challenge we propose ConGaze, a contrastive learning-based framework that leverages unlabeled facial images to learn generic gaze-aware representations across subjects in an unsupervised way. Specifically, we introduce the gaze-specific data augmentation to preserve the gaze-semantic features and maintain the gaze consistency, which are proven to be crucial for effective contrastive gaze representation learning. Moreover, we devise a novel subject-conditional projection module that encourages a share feature extractor to learn gaze-aware and generic representations. Our experiments on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaze Tracking and Assistive Technology · Nasal Surgery and Airway Studies · Face recognition and analysis