ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis
Yakun Yu, Mingjun Zhao, Shi-ang Qi, Feiran Sun, Baoxun Wang, Weidong, Guo, Xiaoli Wang, Lei Yang, Di Niu

TL;DR
ConKI introduces a contrastive knowledge injection method that enhances multimodal sentiment analysis by integrating domain-specific knowledge with general pretrained knowledge through hierarchical contrastive learning, leading to improved prediction accuracy.
Contribution
The paper proposes a novel contrastive knowledge injection framework with an adapter architecture for multimodal sentiment analysis, effectively combining domain-specific and general knowledge representations.
Findings
ConKI outperforms previous methods on three benchmark datasets.
Hierarchical contrastive learning improves representation quality.
Knowledge injection enhances sentiment prediction accuracy.
Abstract
Multimodal Sentiment Analysis leverages multimodal signals to detect the sentiment of a speaker. Previous approaches concentrate on performing multimodal fusion and representation learning based on general knowledge obtained from pretrained models, which neglects the effect of domain-specific knowledge. In this paper, we propose Contrastive Knowledge Injection (ConKI) for multimodal sentiment analysis, where specific-knowledge representations for each modality can be learned together with general knowledge representations via knowledge injection based on an adapter architecture. In addition, ConKI uses a hierarchical contrastive learning procedure performed between knowledge types within every single modality, across modalities within each sample, and across samples to facilitate the effective learning of the proposed representations, hence improving multimodal sentiment predictions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Speech Recognition and Synthesis · Topic Modeling
MethodsContrastive Learning · Adapter
