Localizing AI: Evaluating Open-Weight Language Models for Languages of   Baltic States

Jurgita Kapo\v{c}i\=ut\.e-Dzikien\.e; Toms Bergmanis; M\=arcis Pinnis

arXiv:2501.03952·cs.CL·January 8, 2025

Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States

Jurgita Kapo\v{c}i\=ut\.e-Dzikien\.e, Toms Bergmanis, M\=arcis Pinnis

PDF

Open Access

TL;DR

This study evaluates open-weight multilingual language models for Baltic languages, revealing their strengths in translation but also their limitations like lexical hallucinations, which impact their suitability for sensitive applications.

Contribution

It provides a comprehensive evaluation of open-weight LLMs on lesser-spoken Baltic languages, highlighting their capabilities and limitations in real-world tasks.

Findings

01

Gemma 2 performs close to top commercial models

02

Models exhibit lexical hallucinations in 1 in 20 words

03

Open-weight models show promise but have significant limitations

Abstract

Although large language models (LLMs) have transformed our expectations of modern language technologies, concerns over data privacy often restrict the use of commercially available LLMs hosted outside of EU jurisdictions. This limits their application in governmental, defence, and other data-sensitive sectors. In this work, we evaluate the extent to which locally deployable open-weight LLMs support lesser-spoken languages such as Lithuanian, Latvian, and Estonian. We examine various size and precision variants of the top-performing multilingual open-weight models, Llama~3, Gemma~2, Phi, and NeMo, on machine translation, multiple-choice question answering, and free-form text generation. The results indicate that while certain models like Gemma~2 perform close to the top commercially available models, many LLMs struggle with these languages. Most surprisingly, however, we find that these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques