SentiMM: A Multimodal Multi-Agent Framework for Sentiment Analysis in Social Media

Xilai Xu; Zilin Zhao; Chengye Song; Zining Wang; Jinhe Qiang; Jiongrui Yan; Yuhuai Lin

arXiv:2508.18108·cs.CL·August 26, 2025

SentiMM: A Multimodal Multi-Agent Framework for Sentiment Analysis in Social Media

Xilai Xu, Zilin Zhao, Chengye Song, Zining Wang, Jinhe Qiang, Jiongrui Yan, Yuhuai Lin

PDF

TL;DR

SentiMM is a novel multimodal multi-agent framework that enhances social media sentiment analysis by effectively processing heterogeneous data, integrating external knowledge, and achieving superior classification performance.

Contribution

It introduces a multi-agent system for multimodal sentiment analysis and a large-scale dataset with fine-grained sentiment categories.

Findings

01

SentiMM outperforms existing methods on benchmark datasets.

02

The framework effectively fuses text and visual data.

03

Knowledge integration improves sentiment recognition accuracy.

Abstract

With the increasing prevalence of multimodal content on social media, sentiment analysis faces significant challenges in effectively processing heterogeneous data and recognizing multi-label emotions. Existing methods often lack effective cross-modal fusion and external knowledge integration. We propose SentiMM, a novel multi-agent framework designed to systematically address these challenges. SentiMM processes text and visual inputs through specialized agents, fuses multimodal features, enriches context via knowledge retrieval, and aggregates results for final sentiment classification. We also introduce SentiMMD, a large-scale multimodal dataset with seven fine-grained sentiment categories. Extensive experiments demonstrate that SentiMM achieves superior performance compared to state-of-the-art baselines, validating the effectiveness of our structured approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.