Breaking the Language Barrier: Can Direct Inference Outperform Pre-Translation in Multilingual LLM Applications?
Yotam Intrator, Matan Halfon, Roman Goldenberg, Reut Tsarfaty, Matan, Eyal, Ehud Rivlin, Yossi Matias, Natalia Aizenberg

TL;DR
This paper demonstrates that direct inference without pre-translation often outperforms pre-translation in multilingual large language models, specifically PaLM2, across numerous languages and benchmarks, improving efficiency and linguistic authenticity.
Contribution
It provides a comprehensive evaluation showing that direct inference can surpass pre-translation in multilingual LLMs, challenging established practices and promoting more authentic language processing.
Findings
PaLM2-L outperforms pre-translation in 94 of 108 languages.
Direct inference reduces complexity and information loss.
Study covers 108 languages and 6 diverse benchmarks.
Abstract
Large language models hold significant promise in multilingual applications. However, inherent biases stemming from predominantly English-centric pre-training have led to the widespread practice of pre-translation, i.e., translating non-English inputs to English before inference, leading to complexity and information loss. This study re-evaluates the need for pre-translation in the context of PaLM2 models (Anil et al., 2023), which have been established as highly performant in multilingual tasks. We offer a comprehensive investigation across 108 languages and 6 diverse benchmarks, including open-end generative tasks, which were excluded from previous similar studies. Our findings challenge the pre-translation paradigm established in prior research, highlighting the advantages of direct inference in PaLM2. Specifically, PaLM2-L consistently outperforms pre-translation in 94 out of 108…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques
