TL;DR
This paper introduces a probabilistic and constraint-based syllabification algorithm for Dante's Divine Comedy, enhancing digital humanities research and text processing with detailed phonetic and metric analysis.
Contribution
It presents a novel nondeterministic syllabification method incorporating synalephe propensity and metric constraints, enabling more accurate and flexible analysis of poetic structure.
Findings
Algorithm produces multiple syllabifications with likelihoods.
Incorporates synalephe propensity and metric constraints.
Facilitates digital humanities and deep learning applications.
Abstract
We provide a syllabification algorithm for the Divine Comedy using techniques from probabilistic and constraint programming. We particularly focus on the synalephe, addressed in terms of the "propensity" of a word to take part in a synalephe with adjacent words. We jointly provide an online vocabulary containing, for each word, information about its syllabification, the location of the tonic accent, and the aforementioned synalephe propensity, on the left and right sides. The algorithm is intrinsically nondeterministic, producing different possible syllabifications for each verse, with different likelihoods; metric constraints relative to accents on the 10th, 4th and 6th syllables are used to further reduce the solution space. The most likely syllabification is hence returned as output. We believe that this work could be a major milestone for a lot of different investigations. From the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
