Towards Multimodal Sentiment Analysis via Contrastive Cross-modal Retrieval Augmentation and Hierachical Prompts
Xianbing Zhao, Shengzun Yang, Buzhou Tang, Ronghuan Jiang

TL;DR
This paper introduces a novel multimodal sentiment analysis framework that uses contrastive cross-modal retrieval and hierarchical prompts to incorporate both inter-sample and intra-sample context, significantly improving performance.
Contribution
The work proposes a retrieval-augmented multimodal sentiment analysis model that uniquely combines inter-sample and intra-sample reference contexts using contrastive retrieval and hierarchical prompts.
Findings
Outperforms existing methods on two public datasets.
Effectively captures both inter-sample and intra-sample information.
Demonstrates significant improvement in sentiment classification accuracy.
Abstract
Multimodal sentiment analysis is a fundamental problem in the field of affective computing. Although significant progress has been made in cross-modal interaction, it remains a challenge due to the insufficient reference context in cross-modal interactions. Current cross-modal approaches primarily focus on leveraging modality-level reference context within a individual sample for cross-modal feature enhancement, neglecting the potential cross-sample relationships that can serve as sample-level reference context to enhance the cross-modal features. To address this issue, we propose a novel multimodal retrieval-augmented framework to simultaneously incorporate inter-sample modality-level reference context and cross-sample sample-level reference context to enhance the multimodal features. In particular, we first design a contrastive cross-modal retrieval module to retrieve semantic similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Emotion and Mood Recognition · Multimodal Machine Learning Applications
