Towards Automatic Evaluation for Image Transcreation
Simran Khanuja, Vivek Iyer, Claire He, Graham Neubig

TL;DR
This paper introduces a set of automatic evaluation metrics for image transcreation, inspired by machine translation metrics, to assess cultural relevance, semantic equivalence, and visual similarity, aligning well with human judgments.
Contribution
It proposes a comprehensive suite of automatic metrics for image transcreation evaluation, filling the gap of reliance on human assessment and grounded in translation theory and practice.
Findings
VLMs effectively identify cultural relevance and semantic equivalence.
Vision-encoder representations excel at measuring visual similarity.
Metrics show strong correlation (0.55-0.87) with human ratings across 7 countries.
Abstract
Beyond conventional paradigms of translating speech and text, recently, there has been interest in automated transcreation of images to facilitate localization of visual content across different cultures. Attempts to define this as a formal Machine Learning (ML) problem have been impeded by the lack of automatic evaluation mechanisms, with previous work relying solely on human evaluation. In this paper, we seek to close this gap by proposing a suite of automatic evaluation metrics inspired by machine translation (MT) metrics, categorized into: a) Object-based, b) Embedding-based, and c) VLM-based. Drawing on theories from translation studies and real-world transcreation practices, we identify three critical dimensions of image transcreation: cultural relevance, semantic equivalence and visual similarity, and design our metrics to evaluate systems along these axes. Our results show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Video Analysis and Summarization · Advanced Image Processing Techniques
