Automatic Translation Alignment Pipeline for Multilingual Digital Editions of Literary Works
Maria Levchenko

TL;DR
This paper develops an automated pipeline for creating multilingual digital editions of literary works, addressing challenges in aligning and visualizing original and translated texts across multiple languages.
Contribution
It introduces a novel pipeline and metrics tailored for aligning and visualizing literary translations in digital editions, improving upon existing algorithms.
Findings
Identified limitations of current alignment algorithms for literary texts.
Proposed new metrics for evaluating translation alignment quality.
Created a web-based tool for side-by-side literary translation comparison.
Abstract
This paper investigates the application of translation alignment algorithms in the creation of a Multilingual Digital Edition (MDE) of Alessandro Manzoni's Italian novel "I promessi sposi" ("The Betrothed"), with translations in eight languages (English, Spanish, French, German, Dutch, Polish, Russian and Chinese) from the 19th and 20th centuries. We identify key requirements for the MDE to improve both the reader experience and support for translation studies. Our research highlights the limitations of current state-of-the-art algorithms when applied to the translation of literary texts and outlines an automated pipeline for MDE creation. This pipeline transforms raw texts into web-based, side-by-side representations of original and translated texts with different rendering options. In addition, we propose new metrics for evaluating the alignment of literary translations and suggest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Humanities and Scholarship
