Instruct-Tuning Pretrained Causal Language Models for Ancient Greek   Papyrology and Epigraphy

Eric Cullhed

arXiv:2409.13870·cs.CL·November 19, 2024

Instruct-Tuning Pretrained Causal Language Models for Ancient Greek Papyrology and Epigraphy

Eric Cullhed

PDF

Open Access 10 Models

TL;DR

This paper demonstrates that fine-tuning large pretrained causal language models with instruction templates significantly improves the restoration and attribution of ancient Greek inscriptions and papyri, showing promising results for digital humanities applications.

Contribution

It introduces a straightforward instruction-based fine-tuning approach for large language models to assist in ancient Greek text restoration and attribution tasks, outperforming previous models in text reconstruction.

Findings

01

Achieved a character error rate of 14.9% in text restoration.

02

Outperformed the state-of-the-art model Ithaca in text restoration tasks.

03

Demonstrated effective geographic and chronological attribution with reasonable accuracy.

Abstract

This article presents an experiment in fine-tuning a pretrained causal language model (Meta's Llama 3.1 8B Instruct) to assist with restoring missing or illegible characters in ancient Greek inscriptions and documentary papyri. Utilizing a straightforward instruction-based approach and a 95%/5% train/test split, the papyrus restoration model achieved a character error rate (CER) of 14.9%, a top-1 accuracy of 73.5%, and a top-20 accuracy of 86.0% for sequences up to 10 characters. A model was also fine-tuned for geographic attribution, reaching a top-1 accuracy of 66.4% and a top-3 accuracy of 79.9%. In chronological attribution, it demonstrated an average deviation of 21.7 years from the actual terminus post/ante quem, with a median deviation of 0 years. For inscriptions, the restoration model achieved a CER of 20.5%, a top-1 accuracy of 63.7%, and a top-20 accuracy of 83.0% for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Digital Humanities and Scholarship

MethodsALIGN · Sparse Evolutionary Training · LLaMA