FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning

Zhuozhao Hu; Kaishen Yuan; Xin Liu; Zitong Yu; Yuan Zong; Jingang Shi; Huanjing Yue; Jingyu Yang

arXiv:2505.13419·cs.CV·May 20, 2025

FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning

Zhuozhao Hu, Kaishen Yuan, Xin Liu, Zitong Yu, Yuan Zong, Jingang Shi, Huanjing Yue, Jingyu Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces FEALLM, a multimodal large language model designed to improve facial emotion analysis by leveraging a new dataset and benchmark, enabling better interpretability, reasoning, and generalization in emotion inference from facial cues.

Contribution

The paper presents a novel FEA instruction dataset, a new benchmark FEABench, and a specialized MLLM architecture that significantly enhances emotion analysis performance and reasoning capabilities.

Findings

01

Strong performance on FEABench benchmark

02

Effective zero-shot generalization on multiple datasets

03

Improved interpretability and reasoning in facial emotion analysis

Abstract

Facial Emotion Analysis (FEA) plays a crucial role in visual affective computing, aiming to infer a person's emotional state based on facial data. Scientifically, facial expressions (FEs) result from the coordinated movement of facial muscles, which can be decomposed into specific action units (AUs) that provide detailed emotional insights. However, traditional methods often struggle with limited interpretability, constrained generalization and reasoning abilities. Recently, Multimodal Large Language Models (MLLMs) have shown exceptional performance in various visual tasks, while they still face significant challenges in FEA due to the lack of specialized datasets and their inability to capture the intricate relationships between FEs and AUs. To address these issues, we introduce a novel FEA Instruction Dataset that provides accurate and aligned FE and AU descriptions and establishes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

953206211/feallm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Face recognition and analysis · Face Recognition and Perception