Instruct-Tuning Pretrained Causal Language Models for Ancient Greek Papyrology and Epigraphy
Eric Cullhed

TL;DR
This paper demonstrates that fine-tuning large pretrained causal language models with instruction templates significantly improves the restoration and attribution of ancient Greek inscriptions and papyri, showing promising results for digital humanities applications.
Contribution
It introduces a straightforward instruction-based fine-tuning approach for large language models to assist in ancient Greek text restoration and attribution tasks, outperforming previous models in text reconstruction.
Findings
Achieved a character error rate of 14.9% in text restoration.
Outperformed the state-of-the-art model Ithaca in text restoration tasks.
Demonstrated effective geographic and chronological attribution with reasonable accuracy.
Abstract
This article presents an experiment in fine-tuning a pretrained causal language model (Meta's Llama 3.1 8B Instruct) to assist with restoring missing or illegible characters in ancient Greek inscriptions and documentary papyri. Utilizing a straightforward instruction-based approach and a 95%/5% train/test split, the papyrus restoration model achieved a character error rate (CER) of 14.9%, a top-1 accuracy of 73.5%, and a top-20 accuracy of 86.0% for sequences up to 10 characters. A model was also fine-tuned for geographic attribution, reaching a top-1 accuracy of 66.4% and a top-3 accuracy of 79.9%. In chronological attribution, it demonstrated an average deviation of 21.7 years from the actual terminus post/ante quem, with a median deviation of 0 years. For inscriptions, the restoration model achieved a CER of 20.5%, a top-1 accuracy of 63.7%, and a top-20 accuracy of 83.0% for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Ericu950/Epigr_1_Llama-3.1-8B-Instruct_placemodel· 5 dl· ♡ 15 dl♡ 1
- 🤗Ericu950/Papy_1_Llama-3.1-8B-Instruct_placemodel· 4 dl4 dl
- 🤗Ericu950/Papy_1_Llama-3.1-8B-Instruct_datemodel· 4 dl4 dl
- 🤗RichardErkhov/Ericu950_-_Papy_1_Llama-3.1-8B-Instruct_date-ggufmodel· 562 dl562 dl
- 🤗Ericu950/Epigr_2_Llama-3.1-8B-Instruct_datemodel· 3 dl3 dl
- 🤗Ericu950/Epigr_2_Llama-3.1-8B-Instruct_textmodel· 4 dl4 dl
- 🤗Ericu950/Papy_2_Llama-3.1-8B-Instruct_textmodel· 15 dl· ♡ 215 dl♡ 2
- 🤗RichardErkhov/Ericu950_-_Epigr_1_Llama-3.1-8B-Instruct_place-4bitsmodel· 1 dl1 dl
- 🤗RichardErkhov/Ericu950_-_Epigr_1_Llama-3.1-8B-Instruct_place-8bitsmodel· 1 dl1 dl
- 🤗RichardErkhov/Ericu950_-_Papy_2_Llama-3.1-8B-Instruct_text-8bitsmodel· 1 dl1 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Digital Humanities and Scholarship
MethodsALIGN · Sparse Evolutionary Training · LLaMA
