MSCTD: A Multimodal Sentiment Chat Translation Dataset

Yunlong Liang; Fandong Meng; Jinan Xu; Yufeng Chen; Jie Zhou

arXiv:2202.13645·cs.CL·March 1, 2022

MSCTD: A Multimodal Sentiment Chat Translation Dataset

Yunlong Liang, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new multimodal chat translation task and dataset, demonstrating how visual and sentiment information can improve translation accuracy in conversational contexts.

Contribution

It creates the MSCTD dataset with multilingual dialogue and sentiment annotations and benchmarks baseline systems incorporating multimodal and sentiment features.

Findings

01

Multimodal and sentiment features enhance translation quality.

02

Preliminary experiments show positive impact of contextual information.

03

MSCTD provides new benchmarks for dialogue sentiment analysis.

Abstract

Multimodal machine translation and textual chat translation have received considerable attention in recent years. Although the conversation in its natural form is usually multimodal, there still lacks work on multimodal machine translation in conversations. In this work, we introduce a new task named Multimodal Chat Translation (MCT), aiming to generate more accurate translations with the help of the associated dialogue history and visual context. To this end, we firstly construct a Multimodal Sentiment Chat Translation Dataset (MSCTD) containing 142,871 English-Chinese utterance pairs in 14,762 bilingual dialogues and 30,370 English-German utterance pairs in 3,079 bilingual dialogues. Each utterance pair, corresponding to the visual context that reflects the current conversational scene, is annotated with a sentiment label. Then, we benchmark the task by establishing multiple baseline…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xl2248/msctd
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Natural Language Processing Techniques