Counterfactual Reasoning for Out-of-distribution Multimodal Sentiment   Analysis

Teng Sun; Wenjie Wang; Liqiang Jing; Yiran Cui; Xuemeng Song; Liqiang; Nie

arXiv:2207.11652·cs.CL·July 26, 2022

Counterfactual Reasoning for Out-of-distribution Multimodal Sentiment Analysis

Teng Sun, Wenjie Wang, Liqiang Jing, Yiran Cui, Xuemeng Song, Liqiang, Nie

PDF

1 Repo

TL;DR

This paper introduces a causal inference-based counterfactual framework to improve out-of-distribution multimodal sentiment analysis by mitigating the influence of spurious textual correlations, enhancing model generalization.

Contribution

It proposes a novel counterfactual approach that separates direct and indirect effects of textual modality, addressing spurious correlations for better OOD sentiment analysis.

Findings

01

Outperforms existing methods in OOD sentiment tasks

02

Improves generalization across diverse datasets

03

Effectively isolates textual bias in multimodal models

Abstract

Existing studies on multimodal sentiment analysis heavily rely on textual modality and unavoidably induce the spurious correlations between textual words and sentiment labels. This greatly hinders the model generalization ability. To address this problem, we define the task of out-of-distribution (OOD) multimodal sentiment analysis. This task aims to estimate and mitigate the bad effect of textual modality for strong OOD generalization. To this end, we embrace causal inference, which inspects the causal relationships via a causal graph. From the graph, we find that the spurious correlations are attributed to the direct effect of textual modality on the model prediction while the indirect one is more reliable by considering multimodal semantics. Inspired by this, we devise a model-agnostic counterfactual framework for multimodal sentiment analysis, which captures the direct effect of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

teng-sun/clue_model
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.