LIUM-CVC Submissions for WMT17 Multimodal Translation Task

Ozan Caglayan; Walid Aransa; Adrien Bardet; Mercedes; Garc\'ia-Mart\'inez; Fethi Bougares; Lo\"ic Barrault; Marc Masana; Luis; Herranz; Joost van de Weijer

arXiv:1707.04481·cs.CL·July 17, 2017

LIUM-CVC Submissions for WMT17 Multimodal Translation Task

Ozan Caglayan, Walid Aransa, Adrien Bardet, Mercedes, Garc\'ia-Mart\'inez, Fethi Bougares, Lo\"ic Barrault, Marc Masana, Luis, Herranz, Joost van de Weijer

PDF

TL;DR

This paper presents LIUM-CVC's neural machine translation systems for WMT17, integrating visual features into translation models, achieving top rankings in English-German and English-French tasks.

Contribution

Introduces novel multimodal NMT architectures utilizing visual features, leading to improved translation performance over previous models.

Findings

01

Ranked first in En-De and En-Fr translation tasks

02

Effective integration of visual context enhances translation quality

03

Achieved top scores on METEOR and BLEU metrics

Abstract

This paper describes the monomodal and multimodal Neural Machine Translation systems developed by LIUM and CVC for WMT17 Shared Task on Multimodal Translation. We mainly explored two multimodal architectures where either global visual features or convolutional feature maps are integrated in order to benefit from visual context. Our final systems ranked first for both En-De and En-Fr language pairs according to the automatic evaluation metrics METEOR and BLEU.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.