Cross-linguistic differences and similarities in image descriptions

Emiel van Miltenburg; Desmond Elliott; Piek Vossen

arXiv:1707.01736·cs.CL·August 15, 2017

Cross-linguistic differences and similarities in image descriptions

Emiel van Miltenburg, Desmond Elliott, Piek Vossen

PDF

1 Repo

TL;DR

This paper compares image descriptions across Dutch, English, and German, revealing similarities and the influence of worker familiarity on description detail.

Contribution

It provides the first cross-linguistic analysis of image descriptions, highlighting factors affecting description specificity across languages.

Findings

01

Descriptions are similar across languages in many aspects.

02

Worker familiarity impacts description specificity.

03

Cross-linguistic differences are influenced by cultural and contextual factors.

Abstract

Automatic image description systems are commonly trained and evaluated on large image description datasets. Recently, researchers have started to collect such datasets for languages other than English. An unexplored question is how different these datasets are from English and, if there are any differences, what causes them to differ. This paper provides a cross-linguistic comparison of Dutch, English, and German image descriptions. We find that these descriptions are similar in many respects, but the familiarity of crowd workers with the subjects of the images has a noticeable influence on description specificity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cltl/DutchDescriptions
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.