Latin writing styles analysis with Machine Learning: New approach to old questions
Arianna Di Bernardo, Simone Poetto, Pietro Sillano, Beatrice Villata,, Weronika S\'ojka, Zofia Pi\k{e}tka-Danilewicz, Piotr Pranke

TL;DR
This paper presents a machine learning-based approach to analyze Latin literary styles, aiming to identify authorship, time periods, and source texts by transforming texts into numerical data and applying pattern recognition techniques.
Contribution
It introduces a novel tool that uses NLP and machine learning to analyze Latin texts for authorship, dating, and source identification, enhancing traditional literary analysis methods.
Findings
Successfully identified authorship with high accuracy
Detected stylistic similarities across centuries
Enabled source attribution for anonymous texts
Abstract
In the Middle Ages texts were learned by heart and spread using oral means of communication from generation to generation. Adaptation of the art of prose and poems allowed keeping particular descriptions and compositions characteristic for many literary genres. Taking into account such a specific construction of literature composed in Latin, we can search for and indicate the probability patterns of familiar sources of specific narrative texts. Consideration of Natural Language Processing tools allowed us the transformation of textual objects into numerical ones and then application of machine learning algorithms to extract information from the dataset. We carried out the task consisting of the practical use of those concepts and observation to create a tool for analyzing narrative texts basing on open-source databases. The tool focused on creating specific search tools resources which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Biomedical Text Mining and Ontologies
