Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models
Spyridon Mavromatis, Sokratis Sofianopoulos, Prokopis Prokopidis, Maria Giagkou

TL;DR
This paper introduces a new Ancient Greek to Modern Greek parallel corpus, a novel alignment pipeline, and benchmarks various models, showing significant improvements with fine-tuning on this low-resource translation task.
Contribution
It provides the first comprehensive benchmark for AG-MG translation, a new high-quality dataset, and a novel multi-stage alignment and correction pipeline for low-resource language pairs.
Findings
Fine-tuning models improves BLEU scores by up to +10.3 points.
Full-parameter fine-tuning of Llama-Krikri-8B achieves BLEU 13.16.
The M2M100 model with QLoRA adaptation shows large relative gains.
Abstract
Machine Translation (MT) for Ancient Greek (AG) to Modern Greek (MG) is a low-resource task, constrained by the lack of large-scale, high-quality parallel data. We address this gap by introducing the AG-MG Parallel Corpus, a new resource containing 132,481 sentence-aligned pairs derived from literary, historical, and biblical texts. We present a novel corpus creation pipeline that combines web-scraped, excerpt-level data with a multi-stage sentence-level alignment, and refinement process. Our method uses VecAlign with LaBSE embeddings, which we first fine-tune on a manually-aligned AG-MG subset, followed by an LLM-based error/misalignment correction phase using Gemini 2.5 Flash to ensure high alignment quality. Furthermore, we provide the first comprehensive benchmark of modern MT models on this task, evaluating three fine-tuning strategies across NMT models (NLLB, M2M100) and a Greek…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
