GazeOnce360: Fisheye-Based 360{\deg} Multi-Person Gaze Estimation with Global-Local Feature Fusion
Zhuojiang Cai, Zhenghui Sun, Feng Lu

TL;DR
GazeOnce360 introduces an end-to-end fisheye-based model for 3D multi-person gaze estimation in 360-degree scenes, utilizing a new synthetic dataset and a dual-resolution architecture to handle distortion and capture eye details.
Contribution
The paper presents a novel approach for multi-person gaze estimation from fisheye images, including a large synthetic dataset and a dual-resolution model with feature fusion.
Findings
Effective handling of fisheye distortion through rotational convolutions.
Improved gaze estimation accuracy with dual-resolution feature fusion.
Demonstrated feasibility of 360-degree multi-person gaze estimation.
Abstract
We present GazeOnce360, a novel end-to-end model for multi-person gaze estimation from a single tabletop-mounted upward-facing fisheye camera. Unlike conventional approaches that rely on forward-facing cameras in constrained viewpoints, we address the underexplored setting of estimating the 3D gaze direction of multiple people distributed across a 360{\deg} scene from an upward fisheye perspective. To support research in this setting, we introduce MPSGaze360, a large-scale synthetic dataset rendered using Unreal Engine, featuring diverse multi-person configurations with accurate 3D gaze and eye landmark annotations. Our model tackles the severe distortion and perspective variation inherent in fisheye imagery by incorporating rotational convolutions and eye landmark supervision. To better capture fine-grained eye features crucial for gaze estimation, we propose a dual-resolution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology · Visual Attention and Saliency Detection · Social Robot Interaction and HRI
