Learning Shared Sentiment Prototypes for Adaptive Multimodal Sentiment Analysis
Chen Su, Yuanhe Tian, Yan Song

TL;DR
PRISM is a novel framework for multimodal sentiment analysis that organizes evidence in a shared prototype space and dynamically reweights modalities during reasoning, improving performance on benchmark datasets.
Contribution
It introduces a structured shared prototype space for multimodal evidence and a dynamic reweighting mechanism for adaptive modality fusion during sentiment reasoning.
Findings
PRISM outperforms baseline models on three benchmark datasets.
Shared prototype space enables better cross-modal comparison.
Dynamic reweighting refines modality contributions during reasoning.
Abstract
Multimodal sentiment analysis (MSA) aims to predict human sentiment from textual, acoustic, and visual information in videos. Recent studies improve multimodal fusion by modeling modality interaction and assigning different modality weights. However, they usually compress diverse sentiment cues into a single compact representation before sentiment reasoning. This early aggregation makes it difficult to preserve the internal structure of sentiment evidence, where different cues may complement, conflict with, or differ in reliability from each other. In addition, modality importance is often determined only once during fusion, so later reasoning cannot further adjust modality contributions. To address these issues, we propose PRISM, a framework that unifies structured affective extraction and adaptive modality evaluation. PRISM organizes multimodal evidence in a shared prototype space,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
