3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset
Xinyu Ma, Xuebo Liu, Derek F. Wong, Jun Rao, Bei Li, Liang Ding, Lidia, S. Chao, Dacheng Tao, and Min Zhang

TL;DR
The paper introduces 3AM, a new ambiguity-aware multimodal machine translation dataset with 26,000 English-Chinese sentence-image pairs, designed to enhance the challenge and effectiveness of MMT models by including more ambiguity and variety.
Contribution
It presents a novel dataset that emphasizes ambiguity and diversity, along with benchmarking of state-of-the-art models to demonstrate improved utilization of visual information.
Findings
Models trained on 3AM better exploit visual cues.
3AM dataset contains more ambiguous and diverse data.
Benchmark results show increased challenge for MMT models.
Abstract
Multimodal machine translation (MMT) is a challenging task that seeks to improve translation quality by incorporating visual information. However, recent studies have indicated that the visual information provided by existing MMT datasets is insufficient, causing models to disregard it and overestimate their capabilities. This issue presents a significant obstacle to the development of MMT research. This paper presents a novel solution to this issue by introducing 3AM, an ambiguity-aware MMT dataset comprising 26,000 parallel sentence pairs in English and Chinese, each with corresponding images. Our dataset is specifically designed to include more ambiguity and a greater variety of both captions and images than other MMT datasets. We utilize a word sense disambiguation model to select ambiguous data from vision-and-language datasets, resulting in a more challenging dataset. We further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
