Multi30K: Multilingual English-German Image Descriptions

Desmond Elliott; Stella Frank; Khalil Sima'an; Lucia Specia

arXiv:1605.00459·cs.CL·May 3, 2016

Multi30K: Multilingual English-German Image Descriptions

Desmond Elliott, Stella Frank, Khalil Sima'an, Lucia Specia

PDF

2 Repos

TL;DR

The paper introduces the Multi30K dataset, a multilingual image description dataset with English and German annotations, aimed at advancing research in multilingual multimodal understanding.

Contribution

It provides a new dataset with professionally translated and crowdsourced German descriptions for the Flickr30K images, enabling multilingual multimodal research.

Findings

01

Dataset includes German translations for Flickr30K images.

02

Supports multilingual image description and multimodal machine translation.

03

Facilitates broader research in multilingual multimodal tasks.

Abstract

We introduce the Multi30K dataset to stimulate multilingual multimodal research. Recent advances in image description have been demonstrated on English-language datasets almost exclusively, but image description should not be limited to English. This dataset extends the Flickr30K dataset with i) German translations created by professional translators over a subset of the English descriptions, and ii) descriptions crowdsourced independently of the original English descriptions. We outline how the data can be used for multilingual image description and multimodal machine translation, but we anticipate the data will be useful for a broader range of tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.