Enhancing Ambiguous Dynamic Facial Expression Recognition with Soft Label-based Data Augmentation

Ryosuke Kawamura; Hideaki Hayashi; Shunsuke Otake; Noriko Takemura; Hajime Nagahara

arXiv:2506.20867·cs.CV·June 27, 2025

Enhancing Ambiguous Dynamic Facial Expression Recognition with Soft Label-based Data Augmentation

Ryosuke Kawamura, Hideaki Hayashi, Shunsuke Otake, Noriko Takemura, Hajime Nagahara

PDF

Open Access

TL;DR

This paper introduces MIDAS, a data augmentation technique that uses soft labels and convex combinations of video frames to improve ambiguous dynamic facial expression recognition, achieving superior results on benchmark datasets.

Contribution

The paper proposes MIDAS, a novel soft label-based data augmentation method extending mixup for dynamic facial expression recognition with ambiguous data.

Findings

01

MIDAS improves recognition accuracy on DFEW and FERV39k-Plus datasets.

02

Models trained with MIDAS outperform state-of-the-art methods.

03

Soft label augmentation effectively handles ambiguity in facial expression data.

Abstract

Dynamic facial expression recognition (DFER) is a task that estimates emotions from facial expression video sequences. For practical applications, accurately recognizing ambiguous facial expressions -- frequently encountered in in-the-wild data -- is essential. In this study, we propose MIDAS, a data augmentation method designed to enhance DFER performance for ambiguous facial expression data using soft labels representing probabilities of multiple emotion classes. MIDAS augments training data by convexly combining pairs of video frames and their corresponding emotion class labels. This approach extends mixup to soft-labeled video data, offering a simple yet highly effective method for handling ambiguity in DFER. To evaluate MIDAS, we conducted experiments on both the DFEW dataset and FERV39k-Plus, a newly constructed dataset that assigns soft labels to an existing DFER dataset. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Advanced Computing and Algorithms · Face and Expression Recognition

MethodsMixup