Dynamic Pose-Robust Facial Expression Recognition by Multi-View Pairwise Conditional Random Forests
Arnaud Dapogny, K\'evin Bailly, S\'everine Dubuisson

TL;DR
This paper introduces a multi-view pairwise conditional random forest approach for robust facial expression recognition in videos, effectively handling pose variations and capturing spatio-temporal patterns to improve accuracy.
Contribution
It proposes a novel multi-view, pairwise conditional random forest method that incorporates head pose estimation for dynamic facial expression recognition.
Findings
Significant improvement over standard Random Forests.
Enhanced accuracy on multi-view video datasets.
Effective handling of pose variations in FER.
Abstract
Automatic facial expression classification (FER) from videos is a critical problem for the development of intelligent human-computer interaction systems. Still, it is a challenging problem that involves capturing high-dimensional spatio-temporal patterns describing the variation of one's appearance over time. Such representation undergoes great variability of the facial morphology and environmental factors as well as head pose variations. In this paper, we use Conditional Random Forests to capture low-level expression transition patterns. More specifically, heterogeneous derivative features (e.g. feature point movements or texture variations) are evaluated upon pairs of images. When testing on a video frame, pairs are created between this current frame and previous ones and predictions for each previous frame are used to draw trees from Pairwise Conditional Random Forests (PCRF) whose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
