An Interactive Machine Translation Framework for Modernizing Historical Documents
Miguel Domingo, Francisco Casacuberta

TL;DR
This paper presents an interactive framework combining human expertise and machine translation to modernize historical documents, significantly reducing human effort while improving accuracy.
Contribution
It introduces a collaborative approach for modernizing historical texts, integrating scholar input with machine translation to enhance efficiency and accuracy.
Findings
Significant reduction in human effort required.
Effective collaboration between scholars and machine.
Improved accuracy in modernized documents.
Abstract
Due to the nature of human language, historical documents are hard to comprehend by contemporary people. This limits their accessibility to scholars specialized in the time period in which the documents were written. Modernization aims at breaking this language barrier by generating a new version of a historical document, written in the modern version of the document's original language. However, while it is able to increase the document's comprehension, modernization is still far from producing an error-free version. In this work, we propose a collaborative framework in which a scholar can work together with the machine to generate the new version. We tested our approach on a simulated environment, achieving significant reductions of the human effort needed to produce the modernized version of the document.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Handwritten Text Recognition Techniques
