New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment Analysis
Quy Hoang Nguyen, Minh-Van Truong Nguyen, Kiet Van Nguyen

TL;DR
This paper introduces a new Vietnamese multimodal dataset with fine-grained annotations and proposes a novel cross-modal fusion framework that enhances sentiment analysis accuracy by leveraging detailed image and text interactions.
Contribution
The work provides a new benchmark dataset, ViMACSA, with detailed annotations, and develops a fine-grained cross-modal fusion framework that improves sentiment analysis performance.
Findings
Our framework achieves a highest F1 score of 79.73%.
ViMACSA dataset contains 4,876 text-image pairs with 14,618 annotations.
The framework effectively models intra- and inter-modality interactions.
Abstract
The emergence of multimodal data on social media platforms presents new opportunities to better understand user sentiments toward a given aspect. However, existing multimodal datasets for Aspect-Category Sentiment Analysis (ACSA) often focus on textual annotations, neglecting fine-grained information in images. Consequently, these datasets fail to fully exploit the richness inherent in multimodal. To address this, we introduce a new Vietnamese multimodal dataset, named ViMACSA, which consists of 4,876 text-image pairs with 14,618 fine-grained annotations for both text and image in the hotel domain. Additionally, we propose a Fine-Grained Cross-Modal Fusion Framework (FCMF) that effectively learns both intra- and inter-modality interactions and then fuses these information to produce a unified multimodal representation. Experimental results show that our framework outperforms SOTA models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Computational and Text Analysis Methods · Simulation and Modeling Applications
MethodsFocus
