Multimodal Engagement Analysis from Facial Videos in the Classroom

\"Omer S\"umer; Patricia Goldberg; Sidney D'Mello; Peter Gerjets,; Ulrich Trautwein; Enkelejda Kasneci

arXiv:2101.04215·cs.CV·October 28, 2024

Multimodal Engagement Analysis from Facial Videos in the Classroom

\"Omer S\"umer, Patricia Goldberg, Sidney D'Mello, Peter Gerjets,, Ulrich Trautwein, Enkelejda Kasneci

PDF

TL;DR

This study develops and evaluates computer vision methods to automatically assess student engagement from facial videos in classroom settings, demonstrating promising results and potential for educational research and teacher training.

Contribution

The paper introduces a multimodal approach combining facial expression and head pose analysis with deep learning classifiers for classroom engagement detection, extending prior work to real classroom environments.

Findings

01

Best classifiers achieved AUCs of .620 and .720 for Grades 8 and 12.

02

Score-level fusion improves engagement classification performance.

03

Personalized models with 60-second data improve accuracy by .084 AUC.

Abstract

Student engagement is a key construct for learning and teaching. While most of the literature explored the student engagement analysis on computer-based settings, this paper extends that focus to classroom instruction. To best examine student visual engagement in the classroom, we conducted a study utilizing the audiovisual recordings of classes at a secondary school over one and a half month's time, acquired continuous engagement labeling per student (N=15) in repeated sessions, and explored computer vision methods to classify engagement levels from faces in the classroom. We trained deep embeddings for attentional and emotional features, training Attention-Net for head pose estimation and Affect-Net for facial expression recognition. We additionally trained different engagement classifiers, consisting of Support Vector Machines, Random Forest, Multilayer Perceptron, and Long…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.