Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment   Analysis

Yan Ling; Jianfei Yu; Rui Xia

arXiv:2204.07955·cs.CV·April 22, 2022·1 cites

Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment Analysis

Yan Ling, Jianfei Yu, Rui Xia

PDF

Open Access 1 Repo

TL;DR

This paper introduces a specialized vision-language pre-training framework for multimodal aspect-based sentiment analysis, improving fine-grained understanding and alignment across visual and textual data.

Contribution

It proposes a unified encoder-decoder architecture with task-specific pretraining tasks tailored for MABSA, outperforming previous methods.

Findings

01

Outperforms state-of-the-art on three MABSA subtasks

02

Effective crossmodal alignment and fine-grained aspect identification

03

Pretraining tasks enhance model performance and interpretability

Abstract

As an important task in sentiment analysis, Multimodal Aspect-Based Sentiment Analysis (MABSA) has attracted increasing attention in recent years. However, previous approaches either (i) use separately pre-trained visual and textual models, which ignore the crossmodal alignment or (ii) use vision-language models pre-trained with general pre-training tasks, which are inadequate to identify finegrained aspects, opinions, and their alignments across modalities. To tackle these limitations, we propose a task-specific Vision-Language Pre-training framework for MABSA (VLPMABSA), which is a unified multimodal encoder-decoder architecture for all the pretraining and downstream tasks. We further design three types of task-specific pre-training tasks from the language, vision, and multimodal modalities, respectively. Experimental results show that our approach generally outperforms the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nustm/vlp-mabsa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques · Computational and Text Analysis Methods