Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models

Spyridon Mavromatis; Sokratis Sofianopoulos; Prokopis Prokopidis; Maria Giagkou

arXiv:2605.18504·cs.CL·May 19, 2026

Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models

Spyridon Mavromatis, Sokratis Sofianopoulos, Prokopis Prokopidis, Maria Giagkou

PDF

TL;DR

This paper introduces a new Ancient Greek to Modern Greek parallel corpus, a novel alignment pipeline, and benchmarks various models, showing significant improvements with fine-tuning on this low-resource translation task.

Contribution

It provides the first comprehensive benchmark for AG-MG translation, a new high-quality dataset, and a novel multi-stage alignment and correction pipeline for low-resource language pairs.

Findings

01

Fine-tuning models improves BLEU scores by up to +10.3 points.

02

Full-parameter fine-tuning of Llama-Krikri-8B achieves BLEU 13.16.

03

The M2M100 model with QLoRA adaptation shows large relative gains.

Abstract

Machine Translation (MT) for Ancient Greek (AG) to Modern Greek (MG) is a low-resource task, constrained by the lack of large-scale, high-quality parallel data. We address this gap by introducing the AG-MG Parallel Corpus, a new resource containing 132,481 sentence-aligned pairs derived from literary, historical, and biblical texts. We present a novel corpus creation pipeline that combines web-scraped, excerpt-level data with a multi-stage sentence-level alignment, and refinement process. Our method uses VecAlign with LaBSE embeddings, which we first fine-tune on a manually-aligned AG-MG subset, followed by an LLM-based error/misalignment correction phase using Gemini 2.5 Flash to ensure high alignment quality. Furthermore, we provide the first comprehensive benchmark of modern MT models on this task, evaluating three fine-tuning strategies across NMT models (NLLB, M2M100) and a Greek…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.