Using Vision Language Models to Detect Students' Academic Emotion through Facial Expressions
Deliang Wang, Chao Yang, Gaowei Chen

TL;DR
This paper explores the use of vision-language models to detect students' academic emotions from facial expressions in online learning, demonstrating moderate success and highlighting potential for practical emotion recognition applications.
Contribution
It investigates the application of zero-shot vision-language models for academic emotion detection, showing their capabilities and limitations without requiring fine-tuning.
Findings
Models perform moderately in recognizing facial expressions.
Qwen2.5-VL-7B-Instruct outperforms Llama-3.2-11B-Vision-Instruct.
Models excel at identifying happy emotions but struggle with distracted behavior.
Abstract
Students' academic emotions significantly influence their social behavior and learning performance. Traditional approaches to automatically and accurately analyze these emotions have predominantly relied on supervised machine learning algorithms. However, these models often struggle to generalize across different contexts, necessitating repeated cycles of data collection, annotation, and training. The emergence of Vision-Language Models (VLMs) offers a promising alternative, enabling generalization across visual recognition tasks through zero-shot prompting without requiring fine-tuning. This study investigates the potential of VLMs to analyze students' academic emotions via facial expressions in an online learning environment. We employed two VLMs, Llama-3.2-11B-Vision-Instruct and Qwen2.5-VL-7B-Instruct, to analyze 5,000 images depicting confused, distracted, happy, neutral, and tired…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Intelligent Tutoring Systems and Adaptive Learning · Online Learning and Analytics
