Resource-Limited Joint Multimodal Sentiment Reasoning and Classification via Chain-of-Thought Enhancement and Distillation

Haonan Shangguan; Xiaocui Yang; Shi Feng; Daling Wang; Yifei Zhang; and Ge Yu

arXiv:2508.05234·cs.CL·August 8, 2025

Resource-Limited Joint Multimodal Sentiment Reasoning and Classification via Chain-of-Thought Enhancement and Distillation

Haonan Shangguan, Xiaocui Yang, Shi Feng, Daling Wang, Yifei Zhang, and Ge Yu

PDF

TL;DR

This paper introduces MulCoT-RD, a lightweight model for resource-limited multimodal sentiment reasoning and classification, using a distillation approach from a high-performance teacher model to enable efficient and interpretable sentiment analysis.

Contribution

The paper proposes a novel Multimodal Chain-of-Thought Reasoning Distillation framework for resource-constrained environments, enabling effective sentiment reasoning and classification with a small model.

Findings

01

MulCoT-RD achieves strong performance with only 3B parameters.

02

The model demonstrates robust generalization across datasets.

03

Enhanced interpretability of multimodal sentiment reasoning.

Abstract

The surge in rich multimodal content on social media platforms has greatly advanced Multimodal Sentiment Analysis (MSA), with Large Language Models (LLMs) further accelerating progress in this field. Current approaches primarily leverage the knowledge and reasoning capabilities of parameter-heavy (Multimodal) LLMs for sentiment classification, overlooking autonomous multimodal sentiment reasoning generation in resource-constrained environments. Therefore, we focus on the Resource-Limited Joint Multimodal Sentiment Reasoning and Classification task, JMSRC, which simultaneously performs multimodal sentiment reasoning chain generation and sentiment classification only with a lightweight model. We propose a Multimodal Chain-of-Thought Reasoning Distillation model, MulCoT-RD, designed for JMSRC that employs a "Teacher-Assistant-Student" distillation paradigm to address deployment constraints…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.