MSCTD: A Multimodal Sentiment Chat Translation Dataset
Yunlong Liang, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou

TL;DR
This paper introduces a new multimodal chat translation task and dataset, demonstrating how visual and sentiment information can improve translation accuracy in conversational contexts.
Contribution
It creates the MSCTD dataset with multilingual dialogue and sentiment annotations and benchmarks baseline systems incorporating multimodal and sentiment features.
Findings
Multimodal and sentiment features enhance translation quality.
Preliminary experiments show positive impact of contextual information.
MSCTD provides new benchmarks for dialogue sentiment analysis.
Abstract
Multimodal machine translation and textual chat translation have received considerable attention in recent years. Although the conversation in its natural form is usually multimodal, there still lacks work on multimodal machine translation in conversations. In this work, we introduce a new task named Multimodal Chat Translation (MCT), aiming to generate more accurate translations with the help of the associated dialogue history and visual context. To this end, we firstly construct a Multimodal Sentiment Chat Translation Dataset (MSCTD) containing 142,871 English-Chinese utterance pairs in 14,762 bilingual dialogues and 30,370 English-German utterance pairs in 3,079 bilingual dialogues. Each utterance pair, corresponding to the visual context that reflects the current conversational scene, is annotated with a sentiment label. Then, we benchmark the task by establishing multiple baseline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Natural Language Processing Techniques
