VAD-Net: Multidimensional Facial Expression Recognition in Intelligent Education System
Yi Huo, Yun Ge

TL;DR
This paper introduces VAD annotation for facial expression recognition datasets, proposes an orthogonal convolution network for better VAD prediction, and provides a new benchmark dataset for multidimensional emotion analysis.
Contribution
It provides the first D dimension annotation on FER datasets and develops an orthogonal convolution network to improve VAD emotion prediction accuracy.
Findings
D dimension can be measured but is more challenging than V and A.
Orthogonal convolution improves VAD prediction accuracy.
The new dataset and code are publicly available.
Abstract
Current FER (Facial Expression Recognition) dataset is mostly labeled by emotion categories, such as happy, angry, sad, fear, disgust, surprise, and neutral which are limited in expressiveness. However, future affective computing requires more comprehensive and precise emotion metrics which could be measured by VAD(Valence-Arousal-Dominance) multidimension parameters. To address this, AffectNet has tried to add VA (Valence and Arousal) information, but still lacks D(Dominance). Thus, the research introduces VAD annotation on FER2013 dataset, takes the initiative to label D(Dominance) dimension. Then, to further improve network capacity, it enforces orthogonalized convolution on it, which extracts more diverse and expressive features and will finally increase the prediction accuracy. Experiment results show that D dimension could be measured but is difficult to obtain compared with V and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Educational Technology and Pedagogy
