TL;DR
This study evaluates large language models for POS tagging in medieval Romance languages, showing they outperform traditional methods, especially with transfer learning, and provides practical guidance for digital humanities applications.
Contribution
It offers a systematic empirical comparison of LLMs and traditional taggers for medieval languages, highlighting transfer learning strategies and releasing resources for reproducibility.
Findings
LLMs outperform traditional POS taggers across medieval languages.
Fine-tuning and multilingual training significantly improve LLM performance.
Cross-lingual transfer learning benefits under-resourced medieval language varieties.
Abstract
Part-of-speech (POS) tagging for Medieval Romance languages remains challenging due to orthographic variation, morphological complexity, and limited annotated resources. This paper presents a systematic empirical evaluation of large language models (LLMs) for POS tagging across three medieval varieties: Medieval Occitan, Medieval Catalan, and Medieval French. We compare traditional rule-based and statistical taggers with modern open-source LLMs under zero-shot prompting, few-shot prompting, monolingual fine-tuning, and cross-lingual transfer learning settings. Experiments on historically grounded datasets show that LLM-based approaches consistently outperform traditional taggers, with fine-tuning and multilingual training yielding the largest improvements. In particular, cross-lingual transfer learning substantially benefits under-resourced varieties, while targeted bilingual training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
