Unlocking the Archives: Using Large Language Models to Transcribe Handwritten Historical Documents
Mark Humphries, Lianne C. Leddy, Quinn Downton, Meredith Legace, John, McConnell, Isabella Murray, and Elizabeth Spence

TL;DR
This paper shows that large language models can transcribe and correct historical handwritten documents more accurately, faster, and cheaper than specialized software, enabling efficient digitization of archives.
Contribution
Introduces Transcription Pearl, an open-source tool leveraging LLMs for high-accuracy, cost-effective transcription and correction of handwritten historical documents.
Findings
LLMs achieved CER of 5.7-7% and WER of 8.9-15.9% on historical texts.
LLMs improved transcription accuracy over state-of-the-art HTR software by 14-32%.
LLMs achieved near-human accuracy with CER as low as 1.8% and WER of 3.5%.
Abstract
This study demonstrates that Large Language Models (LLMs) can transcribe historical handwritten documents with significantly higher accuracy than specialized Handwritten Text Recognition (HTR) software, while being faster and more cost-effective. We introduce an open-source software tool called Transcription Pearl that leverages these capabilities to automatically transcribe and correct batches of handwritten documents using commercially available multimodal LLMs from OpenAI, Anthropic, and Google. In tests on a diverse corpus of 18th/19th century English language handwritten documents, LLMs achieved Character Error Rates (CER) of 5.7 to 7% and Word Error Rates (WER) of 8.9 to 15.9%, improvements of 14% and 32% respectively over specialized state-of-the-art HTR software like Transkribus. Most significantly, when LLMs were then used to correct those transcriptions as well as texts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Digital Humanities and Scholarship
