Multi-view Gaze Target Estimation

Qiaomu Miao; Vivek Raju Golani; Jingyi Xu; Progga Paromita Dutta; Minh Hoai; Dimitris Samaras

arXiv:2508.05857·cs.CV·August 11, 2025

Multi-view Gaze Target Estimation

Qiaomu Miao, Vivek Raju Golani, Jingyi Xu, Progga Paromita Dutta, Minh Hoai, Dimitris Samaras

PDF

Open Access

TL;DR

This paper introduces a multi-view gaze target estimation method that leverages multiple camera views to improve accuracy, handle occlusions, and estimate gaze targets even when the face is not visible, supported by a new dataset.

Contribution

The paper proposes a novel multi-view GTE approach with modules for head information aggregation, uncertainty-based gaze selection, and scene attention, along with a new dataset for evaluation.

Findings

01

Outperforms single-view baselines significantly

02

Enables gaze estimation even with occluded faces

03

Provides a new dataset for multi-view GTE

Abstract

This paper presents a method that utilizes multiple camera views for the gaze target estimation (GTE) task. The approach integrates information from different camera views to improve accuracy and expand applicability, addressing limitations in existing single-view methods that face challenges such as face occlusion, target ambiguity, and out-of-view targets. Our method processes a pair of camera views as input, incorporating a Head Information Aggregation (HIA) module for leveraging head information from both views for more accurate gaze estimation, an Uncertainty-based Gaze Selection (UGS) for identifying the most reliable gaze output, and an Epipolar-based Scene Attention (ESA) module for cross-view background information sharing. This approach significantly outperforms single-view baselines, especially when the second camera provides a clear view of the person's face. Additionally,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaze Tracking and Assistive Technology · Advanced Computing and Algorithms