Multimodal Representations Learning Based on Mutual Information   Maximization and Minimization and Identity Embedding for Multimodal Sentiment   Analysis

Jiahao Zheng; Sen Zhang; Xiaoping Wang; Zhigang Zeng

arXiv:2201.03969·cs.LG·July 5, 2022·5 cites

Multimodal Representations Learning Based on Mutual Information Maximization and Minimization and Identity Embedding for Multimodal Sentiment Analysis

Jiahao Zheng, Sen Zhang, Xiaoping Wang, Zhigang Zeng

PDF

Open Access

TL;DR

This paper introduces MMMIE, a novel multimodal representation model that uses mutual information techniques and identity embedding to improve sentiment analysis by addressing heterogeneity and contextual modeling challenges.

Contribution

The paper proposes a new multimodal representation approach combining mutual information maximization and minimization with identity embedding for enhanced sentiment analysis.

Findings

01

Effective in bridging heterogeneity gap between modalities

02

Improves modeling of contextual dynamics

03

Demonstrates superior performance on public datasets

Abstract

Multimodal sentiment analysis (MSA) is a fundamental complex research problem due to the heterogeneity gap between different modalities and the ambiguity of human emotional expression. Although there have been many successful attempts to construct multimodal representations for MSA, there are still two challenges to be addressed: 1) A more robust multimodal representation needs to be constructed to bridge the heterogeneity gap and cope with the complex multimodal interactions, and 2) the contextual dynamics must be modeled effectively throughout the information flow. In this work, we propose a multimodal representation model based on Mutual information Maximization and Minimization and Identity Embedding (MMMIE). We combine mutual information maximization between modal pairs, and mutual information minimization between input data and corresponding features to mine the modal-invariant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Text and Document Classification Technologies