Variable-state Latent Conditional Random Fields for Facial Expression Recognition and Action Unit Detection
Robert Walecki, Ognjen Rudovic, Vladimir Pavlovic, Maja Pantic

TL;DR
This paper introduces the variable-state L-CRF model that adaptively selects between nominal and ordinal latent states for improved facial expression and action unit recognition, enhancing generalization over existing methods.
Contribution
The paper proposes the VSL-CRF model with a novel graph-Laplacian regularization to automatically choose optimal latent states, addressing limitations of fixed state assumptions in L-CRFs.
Findings
VSL-CRF outperforms traditional L-CRFs on public databases.
The model achieves better generalization in facial expression recognition.
Regularization reduces overfitting of latent states.
Abstract
Automated recognition of facial expressions of emotions, and detection of facial action units (AUs), from videos depends critically on modeling of their dynamics. These dynamics are characterized by changes in temporal phases (onset-apex-offset) and intensity of emotion expressions and AUs, the appearance of which may vary considerably among target subjects, making the recognition/detection task very challenging. The state-of-the-art Latent Conditional Random Fields (L-CRF) framework allows one to efficiently encode these dynamics through the latent states accounting for the temporal consistency in emotion expression and ordinal relationships between its intensity levels, these latent states are typically assumed to be either unordered (nominal) or fully ordered (ordinal). Yet, such an approach is often too restrictive. For instance, in the case of AU detection, the goal is to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face and Expression Recognition · Human Pose and Action Recognition
