GaTector+: A Unified Head-free Framework for Gaze Object and Gaze Following Prediction
Yang Jin, Guangyu Guo, Binglu Wang

TL;DR
GaTector+ is a unified, head-free framework that improves gaze object detection and gaze following by integrating shared features, head detection, and attention mechanisms, eliminating the need for head priors during inference.
Contribution
This paper introduces GaTector+, a novel unified model that jointly performs gaze object detection and gaze following without relying on head-related priors, using a shared backbone and attention mechanisms.
Findings
Outperforms previous methods on multiple benchmarks.
Effectively predicts gaze objects and following points without head priors.
Introduces a new evaluation metric, mSoC, for gaze object detection.
Abstract
Gaze object detection and gaze following are fundamental tasks for interpreting human gaze behavior or intent. However, most previous methods usually solve these two tasks separately, and their prediction of gaze objects and gaze following typically depend on head-related prior knowledge during both the training phase and real-world deployment. This dependency necessitates an auxiliary network to extract head location, thus precluding joint optimization across the entire system and constraining the practical applicability. To this end, we propose GaTector+, a unified framework for gaze object detection and gaze following, which eliminates the dependence on the head-related priors during inference. Specifically, GaTector+ uses an expanded specific-general-specific feature extractor that leverages a shared backbone, which extracts general features for gaze following and object detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
