Gazing Into Missteps: Leveraging Eye-Gaze for Unsupervised Mistake Detection in Egocentric Videos of Skilled Human Activities
Michele Mazzamuto, Antonino Furnari, Yoichi Sato, Giovanni Maria Farinella

TL;DR
This paper introduces an unsupervised gaze-based method for mistake detection in egocentric videos of skilled activities, leveraging eye-gaze signals and a novel gaze completion task to identify deviations without labeled data.
Contribution
It proposes a new gaze completion approach and a mistake detection framework that does not require mistake annotations, outperforming supervised methods in various settings.
Findings
Achieves up to +14% accuracy gains in multiple datasets.
Effectively detects mistakes without labeled data.
Ranks first on the HoloAssist Mistake Detection challenge.
Abstract
We address the challenge of unsupervised mistake detection in egocentric video of skilled human activities through the analysis of gaze signals. While traditional methods rely on manually labeled mistakes, our approach does not require mistake annotations, hence overcoming the need of domain-specific labeled data. Based on the observation that eye movements closely follow object manipulation activities, we assess to what extent eye-gaze signals can support mistake detection, proposing to identify deviations in attention patterns measured through a gaze tracker with respect to those estimated by a gaze prediction model. Since predicting gaze in video is characterized by high uncertainty, we propose a novel gaze completion task, where eye fixations are predicted from visual observations and partial gaze trajectories, and contribute a novel gaze completion approach which explicitly models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Face recognition and analysis · Advanced Image Processing Techniques
