Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers

Hannah Calzi Kleidermacher; James Zou

arXiv:2502.17882·cs.AI·September 5, 2025

Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers

Hannah Calzi Kleidermacher, James Zou

PDF

TL;DR

This paper presents an automated LLM-based method for translating scientific articles into multiple languages while preserving formatting, evaluating accuracy through a novel QA benchmark and user studies, demonstrating high fidelity and adaptability.

Contribution

It introduces a practical, automated approach for multilingual scientific translation using LLMs, including a novel QA benchmark and techniques to align translations with domain preferences.

Findings

01

Average translation accuracy of 95.9% on scientific content

02

Authors found translations accurately captured original information

03

In-context learning can mitigate overtranslation issues

Abstract

Scientific research is inherently global. However, the vast majority of academic journals are published exclusively in English, creating barriers for non-native-English-speaking researchers. In this study, we leverage large language models (LLMs) to translate published scientific articles while preserving their native JATS XML formatting, thereby developing a practical, automated approach for implementation by academic journals. Using our approach, we translate articles across multiple scientific disciplines into 28 languages. To evaluate translation accuracy, we introduce a novel question-and-answer (QA) benchmarking method, in which an LLM generates comprehension-based questions from the original text and then answers them based on the translated text. Our benchmark results show an average performance of 95.9%, showing that the key scientific details are accurately conveyed. In a user…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsALIGN