Preliminary WMT24 Ranking of General MT Systems and LLMs

Tom Kocmi; Eleftherios Avramidis; Rachel Bawden; Ondrej Bojar; Anton; Dvorkovich; Christian Federmann; Mark Fishel; Markus Freitag; Thamme Gowda,; Roman Grundkiewicz; Barry Haddow; Marzena Karpinska; Philipp Koehn; Benjamin; Marie; Kenton Murray; Masaaki Nagata; Martin Popel; Maja Popovic; Mariya; Shmatova; Stein{\th}\'or Steingr\'imsson; Vil\'em Zouhar

arXiv:2407.19884·cs.CL·July 30, 2024·2 cites

Preliminary WMT24 Ranking of General MT Systems and LLMs

Tom Kocmi, Eleftherios Avramidis, Rachel Bawden, Ondrej Bojar, Anton, Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda,, Roman Grundkiewicz, Barry Haddow, Marzena Karpinska, Philipp Koehn, Benjamin, Marie, Kenton Murray, Masaaki Nagata, Martin Popel

PDF

Open Access 1 Repo

TL;DR

This paper presents a preliminary automatic ranking of WMT24 general machine translation systems and large language models, serving as an early benchmark before the final human evaluation.

Contribution

It provides an initial automatic ranking of MT systems for WMT24, aiding participants before the official human evaluation results are available.

Findings

01

Preliminary automatic rankings are established.

02

Human evaluation will supersede automatic metrics.

03

Results aim to assist system development.

Abstract

This is the preliminary ranking of WMT24 General MT systems based on automatic metrics. The official ranking will be a human evaluation, which is superior to the automatic ranking and supersedes it. The purpose of this report is not to interpret any findings but only provide preliminary results to the participants of the General MT task that may be useful during the writing of the system submission.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wmt-conference/wmt-collect-translations
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Research in Systems and Signal Processing