A Gaze Estimation Method Based on Spatial and Channel Reconstructed ResNet Combined with Multi-Clue Fusion
Zhaoyu Shou, Yanjun Lin, Jianwen Mo, Ziyong Wu

TL;DR
This paper introduces a new gaze estimation model that improves accuracy by combining spatial and channel feature extraction with multi-clue fusion for online learning applications.
Contribution
The novel RSP-MCGaze model integrates ResNet and SCConv to better extract features and model spatio-temporal relationships in gaze estimation.
Findings
The model achieves a detection error of 9.86 on the Gaze360 dataset.
It achieves a detection error of 7.11 on the detectable face subset of Gaze360.
The model outperforms existing baseline models on public datasets.
Abstract
The complexity of various factors influencing online learning makes it difficult to characterize learning concentration, while Accurately estimating students’ gaze points during learning video sessions represents a critical scientific challenge in assessing and enhancing the attentiveness of online learners. However, current appearance-based gaze estimation models lack a focus on extracting essential features and fail to effectively model the spatio-temporal relationships among the head, face, and eye regions, which limits their ability to achieve lower angular errors. This paper proposes an appearance-based gaze estimation model (RSP-MCGaze). The model constructs a feature extraction backbone network for gaze estimation (ResNetSC) by integrating ResNet and SCConv; this integration enhances the model’s ability to extract important features while reducing spatial and channel redundancy.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Gaze Tracking and Assistive Technology · Advanced Computing and Algorithms
