Face Trees for Expression Recognition
Mojtaba Kolahdouzi, Alireza Sepas-Moghaddam, Ali Etemad

TL;DR
This paper introduces a novel end-to-end facial expression recognition architecture that learns optimal facial landmark trees and combines structural and texture information through a dual-stream attention mechanism, achieving state-of-the-art results.
Contribution
It presents a new tree-based landmark topology learning method combined with a dual-stream attention architecture for improved expression recognition.
Findings
Outperforms existing methods on AffectNet and FER2013 datasets.
Achieves new state-of-the-art accuracy in facial expression recognition.
Demonstrates the effectiveness of combining structural and texture features.
Abstract
We propose an end-to-end architecture for facial expression recognition. Our model learns an optimal tree topology for facial landmarks, whose traversal generates a sequence from which we obtain an embedding to feed a sequential learner. The proposed architecture incorporates two main streams, one focusing on landmark positions to learn the structure of the face, while the other focuses on patches around the landmarks to learn texture information. Each stream is followed by an attention mechanism and the outputs are fed to a two-stream fusion component to perform the final classification. We conduct extensive experiments on two large-scale publicly available facial expression datasets, AffectNet and FER2013, to evaluate the efficacy of our approach. Our method outperforms other solutions in the area and sets new state-of-the-art expression recognition rates on these datasets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face recognition and analysis · Face and Expression Recognition
