PTA: Enhancing Multimodal Sentiment Analysis through Pipelined Prediction and Translation-based Alignment
Shezheng Song, Shasha Li, Shan Zhao, Chengyu Wang, Xiaopeng Li, Jie, Yu, Qian Wan, Jun Ma, Tianwei Yan, Wentao Ma, Xiaoguang Mao

TL;DR
This paper introduces a pipeline framework for multimodal aspect-based sentiment analysis that improves image utilization and achieves state-of-the-art results by separating aspect extraction and sentiment classification with translation-based alignment.
Contribution
The paper proposes a novel pipeline approach with translation-based alignment for MABSA, addressing limitations of joint models in aligning text and images.
Findings
Achieves state-of-the-art performance on Twitter-15 and Twitter-17 datasets.
Demonstrates the effectiveness of separating aspect extraction and sentiment classification.
Shows that translation-based alignment enhances multimodal semantic consistency.
Abstract
Multimodal aspect-based sentiment analysis (MABSA) aims to understand opinions in a granular manner, advancing human-computer interaction and other fields. Traditionally, MABSA methods use a joint prediction approach to identify aspects and sentiments simultaneously. However, we argue that joint models are not always superior. Our analysis shows that joint models struggle to align relevant text tokens with image patches, leading to misalignment and ineffective image utilization. In contrast, a pipeline framework first identifies aspects through MATE (Multimodal Aspect Term Extraction) and then aligns these aspects with image patches for sentiment classification (MASC: Multimodal Aspect-Oriented Sentiment Classification). This method is better suited for multimodal scenarios where effective image use is crucial. We present three key observations: (a) MATE and MASC have different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Sentiment Analysis and Opinion Mining · Topic Modeling
MethodsALIGN · MATE
