Vision-based Engagement Detection in Virtual Reality
Ghassem Tofighi, Kaamraan Raahemifar, Maria Frank, Haisong Gu

TL;DR
This paper introduces DAIA, a multi-modal engagement detection model for VR that uses biometric data and a finite state transducer to accurately identify user engagement states, improving gesture recognition robustness.
Contribution
The paper presents a novel multi-modal engagement detection framework using FST for VR, enabling automatic mental status recognition beyond focus gestures.
Findings
FST achieved a 92.3% true detection rate across four engagement states.
The framework effectively segments user hand gestures.
Multi-modal data integration enhances engagement detection robustness.
Abstract
User engagement modeling for manipulating actions in vision-based interfaces is one of the most important case studies of user mental state detection. In a Virtual Reality environment that employs camera sensors to recognize human activities, we have to know when user intends to perform an action and when not. Without a proper algorithm for recognizing engagement status, any kind of activities could be interpreted as manipulating actions, called "Midas Touch" problem. Baseline approach for solving this problem is activating gesture recognition system using some focus gestures such as waiving or raising hand. However, a desirable natural user interface should be able to understand user's mental status automatically. In this paper, a novel multi-modal model for engagement detection, DAIA, is presented. using DAIA, the spectrum of mental status for performing an action is quantized in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Gaze Tracking and Assistive Technology · Human Pose and Action Recognition
