Creating an Aligned Corpus of Sound and Text: The Multimodal Corpus of Shakespeare and Milton
Manex Agirrezabal

TL;DR
This paper introduces a multimodal corpus of Shakespeare and Milton poems, aligned with audio and phonetic details, enabling advanced linguistic and literary analysis.
Contribution
It presents a novel aligned corpus of classical poetry with detailed audio, phonetic, and scansion annotations, along with a visualization platform.
Findings
Aligned lines with audio at multiple levels
Includes phonetic and scansion annotations
Provides a visualization platform
Abstract
In this work we present a corpus of poems by William Shakespeare and John Milton that have been enriched with readings from the public domain. We have aligned all the lines with their respective audio segments, at the line, word, syllable and phone level, and we have included their scansion. We make a basic visualization platform for these poems and we conclude by conjecturing possible future directions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTranslation Studies and Practices · Language, Metaphor, and Cognition · Digital Humanities and Scholarship
