ExpLLM: Towards Chain of Thought for Facial Expression Recognition
Xing Lan, Jian Xue, Ji Qi, Dongmei Jiang, Ke Lu, Tat-Seng Chua

TL;DR
This paper introduces ExpLLM, a novel approach using large language models to generate a chain of thought for facial expression recognition, improving accuracy and interpretability over existing methods.
Contribution
The paper presents a new method called ExpLLM that leverages LLMs to produce detailed chains of thought for FER, incorporating AU interactions and emotional analysis.
Findings
ExpLLM outperforms state-of-the-art FER methods on RAF-DB and AffectNet datasets.
ExpLLM surpasses GPT-4o in expression CoT generation, especially for micro-expressions.
The approach enhances interpretability and accuracy in facial expression recognition.
Abstract
Facial expression recognition (FER) is a critical task in multimedia with significant implications across various domains. However, analyzing the causes of facial expressions is essential for accurately recognizing them. Current approaches, such as those based on facial action units (AUs), typically provide AU names and intensities but lack insight into the interactions and relationships between AUs and the overall expression. In this paper, we propose a novel method called ExpLLM, which leverages large language models to generate an accurate chain of thought (CoT) for facial expression recognition. Specifically, we have designed the CoT mechanism from three key perspectives: key observations, overall emotional interpretation, and conclusion. The key observations describe the AU's name, intensity, and associated emotions. The overall emotional interpretation provides an analysis based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Face and Expression Recognition
