TL;DR
Semi-IIN is a semi-supervised learning network that effectively captures intra- and inter-modal interactions for multimodal sentiment analysis, reducing annotation costs and improving performance on public datasets.
Contribution
It introduces a novel semi-supervised framework with masked attention and gating mechanisms for dynamic interaction selection in multimodal sentiment analysis.
Findings
Achieves state-of-the-art results on MOSI and MOSEI datasets.
Effectively utilizes unlabeled data through self-training.
Demonstrates the importance of adaptive intra- and inter-modal interaction modeling.
Abstract
Despite multimodal sentiment analysis being a fertile research ground that merits further investigation, current approaches take up high annotation cost and suffer from label ambiguity, non-amicable to high-quality labeled data acquisition. Furthermore, choosing the right interactions is essential because the significance of intra- or inter-modal interactions can differ among various samples. To this end, we propose Semi-IIN, a Semi-supervised Intra-inter modal Interaction learning Network for multimodal sentiment analysis. Semi-IIN integrates masked attention and gating mechanisms, enabling effective dynamic selection after independently capturing intra- and inter-modal interactive information. Combined with the self-training approach, Semi-IIN fully utilizes the knowledge learned from unlabeled data. Experimental results on two public datasets, MOSI and MOSEI, demonstrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
MethodsSoftmax · Attention Is All You Need
