MATT: Multimodal Attention Level Estimation for e-learning Platforms
Roberto Daza, Luis F. Gomez, Aythami Morales, Julian Fierrez, Ruben, Tolosana, Ruth Cobos, Javier Ortega-Garcia

TL;DR
This paper introduces a multimodal system leveraging facial and behavioral cues with CNNs to estimate student attention levels during online learning, demonstrating improved accuracy through score-level fusion on a public dataset.
Contribution
The study develops a novel multimodal attention estimation system combining CNN-based modules and fusion techniques for e-learning environments, validated on a new public dataset.
Findings
Multimodal fusion improves attention estimation accuracy.
CNN-based modules effectively extract behavioral and physiological features.
System validated on the mEBAL dataset with promising results.
Abstract
This work presents a new multimodal system for remote attention level estimation based on multimodal face analysis. Our multimodal approach uses different parameters and signals obtained from the behavior and physiological processes that have been related to modeling cognitive load such as faces gestures (e.g., blink rate, facial actions units) and user actions (e.g., head pose, distance to the camera). The multimodal system uses the following modules based on Convolutional Neural Networks (CNNs): Eye blink detection, head pose estimation, facial landmark detection, and facial expression features. First, we individually evaluate the proposed modules in the task of estimating the student's attention level captured during online e-learning sessions. For that we trained binary classifiers (high or low attention) based on Support Vector Machines (SVM) for each module. Secondly, we find out…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology
