Two Demonstrations of the Machine Translation Applications to Historical Documents
Miguel Domingo, Francisco Casacuberta

TL;DR
This paper demonstrates two machine translation applications for historical documents: modernizing language and orthography, using an interactive, adaptive system with online learning, showcased through a publicly accessible web platform.
Contribution
It introduces an interactive, adaptive machine translation system for historical documents that incorporates user corrections and online learning, with open-source implementation.
Findings
Effective orthography normalization for historical texts
User-in-the-loop adaptive translation improves accuracy
Open-source web demo available for public use
Abstract
We present our demonstration of two machine translation applications to historical documents. The first task consists in generating a new version of a historical document, written in the modern version of its original language. The second application is limited to a document's orthography. It adapts the document's spelling to modern standards in order to achieve an orthography consistency and accounting for the lack of spelling conventions. We followed an interactive, adaptive framework that allows the user to introduce corrections to the system's hypothesis. The system reacts to these corrections by generating a new hypothesis that takes them into account. Once the user is satisfied with the system's hypothesis and validates it, the system adapts its model following an online learning strategy. This system is implemented following a client-server architecture. We developed a website…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Biomedical Text Mining and Ontologies
