Compound Expression Recognition via Multi Model Ensemble
Jun Yu, Jichao Zhu, Wangyuan Zhu

TL;DR
This paper introduces an ensemble learning approach combining convolutional networks, Vision Transformers, and local attention networks for improved compound expression recognition, achieving high accuracy and zero-shot capabilities.
Contribution
It proposes a novel ensemble method that integrates multiple models for more accurate and robust compound expression recognition.
Findings
High accuracy on RAF-DB dataset
Effective zero-shot recognition on C-EXPR-DB
Ensemble approach outperforms individual models
Abstract
Compound Expression Recognition (CER) plays a crucial role in interpersonal interactions. Due to the existence of Compound Expressions , human emotional expressions are complex, requiring consideration of both local and global facial expressions to make judgments. In this paper, to address this issue, we propose a solution based on ensemble learning methods for Compound Expression Recognition. Specifically, our task is classification, where we train three expression classification models based on convolutional networks, Vision Transformers, and multi-scale local attention networks. Then, through model ensemble using late fusion, we merge the outputs of multiple models to predict the final result. Our method achieves high accuracy on RAF-DB and is able to recognize expressions through zero-shot on certain portions of C-EXPR-DB.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Machine Learning in Bioinformatics
