Robust Dynamic Facial Expression Recognition
Feng Liu, Hanyang Wang, Siyuan Shen

TL;DR
This paper introduces RDFER, a robust method for dynamic facial expression recognition that effectively distinguishes hard and noisy samples, improving accuracy on benchmark datasets through novel sampling and hierarchical modeling techniques.
Contribution
It proposes a new framework combining key expression re-sampling and dual-stream hierarchical networks to enhance robustness and representation in dynamic facial expression recognition.
Findings
Outperforms state-of-the-art methods on DFEW and FERV39K datasets.
Effectively distinguishes between hard and noisy samples.
Provides insights into agreement-based sample evaluation.
Abstract
The study of Dynamic Facial Expression Recognition (DFER) is a nascent field of research that involves the automated recognition of facial expressions in video data. Although existing research has primarily focused on learning representations under noisy and hard samples, the issue of the coexistence of both types of samples remains unresolved. In order to overcome this challenge, this paper proposes a robust method of distinguishing between hard and noisy samples. This is achieved by evaluating the prediction agreement of the model on different sampled clips of the video. Subsequently, methodologies that reinforce the learning of hard samples and mitigate the impact of noisy samples can be employed. Moreover, to identify the principal expression in a video and enhance the model's capacity for representation learning, comprising a key expression re-sampling framework and a dual-stream…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition
