The MeMAD Submission to the WMT18 Multimodal Translation Task
Stig-Arne Gr\"onroos, Benoit Huet, Mikko Kurimo, Jorma, Laaksonen, Bernard Merialdo, Phu Pham, Mats Sj\"oberg, Umut, Sulubacak, J\"org Tiedemann, Raphael Troncy, Ra\'ul V\'azquez

TL;DR
This paper presents the MeMAD system for the WMT18 multimodal translation task, adapting Transformer models to include visual features, but finds text quality and data more impactful than visual input.
Contribution
It introduces a Transformer-based multimodal translation approach and demonstrates the importance of text data quality over visual features.
Findings
Visual features have limited impact on translation quality.
High-quality text-only NMT systems outperform multimodal variants.
Additional data improves translation performance.
Abstract
This paper describes the MeMAD project entry to the WMT Multimodal Machine Translation Shared Task. We propose adapting the Transformer neural machine translation (NMT) architecture to a multi-modal setting. In this paper, we also describe the preliminary experiments with text-only translation systems leading us up to this choice. We have the top scoring system for both English-to-German and English-to-French, according to the automatic metrics for flickr18. Our experiments show that the effect of the visual features in our system is small. Our largest gains come from the quality of the underlying text-only NMT system. We find that appropriate use of additional data is effective.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Multimodal Machine Learning Applications · Topic Modeling
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax
