TL;DR
This paper introduces the Bi-Bimodal Fusion Network (BBFN), an end-to-end model that dynamically balances relevance and independence among modalities for improved multimodal sentiment analysis, outperforming state-of-the-art methods.
Contribution
The paper proposes a novel end-to-end bimodal fusion network with a relevance-separation mechanism and gated Transformer control for better multimodal sentiment analysis.
Findings
BBFN significantly outperforms SOTA on three datasets
The relevance-separation mechanism improves fusion quality
Gated Transformer enhances final sentiment prediction accuracy
Abstract
Multimodal sentiment analysis aims to extract and integrate semantic information collected from multiple modalities to recognize the expressed emotions and sentiment in multimodal data. This research area's major concern lies in developing an extraordinary fusion scheme that can extract and integrate key information from various modalities. However, one issue that may restrict previous work to achieve a higher level is the lack of proper modeling for the dynamics of the competition between the independence and relevance among modalities, which could deteriorate fusion outcomes by causing the collapse of modality-specific feature space or introducing extra noise. To mitigate this, we propose the Bi-Bimodal Fusion Network (BBFN), a novel end-to-end network that performs fusion (relevance increment) and separation (difference increment) on pairwise modality representations. The two parts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Layer Normalization · Byte Pair Encoding · Label Smoothing · Residual Connection · Dense Connections
