The \textit{Questio de aqua et terra}: A Computational Authorship Verification Study
Martina Leocata, Alejandro Moreo, Fabrizio Sebastiani

TL;DR
This study uses advanced computational authorship verification techniques, including a novel oversampling method, to assess the authenticity of a controversial medieval text attributed to Dante, achieving high accuracy and contributing to cultural heritage analysis.
Contribution
Introduces Distributional Random Oversampling (DRO) for authorship verification and applies it to a historical text, demonstrating its effectiveness in cultural heritage studies.
Findings
Best AV system achieved F1=0.970 accuracy
DRO significantly improved authorship verification results
System confidently supports the text's authenticity
Abstract
The Questio de aqua et terra is a cosmological treatise traditionally attributed to Dante Alighieri. However, the authenticity of this text is controversial, due to discrepancies with Dante's established works and to the absence of contemporary references. This study investigates the authenticity of the Questio via computational authorship verification (AV), a class of techniques which combine supervised machine learning and stylometry. We build a family of AV systems and assemble a corpus of 330 13th- and 14th-century Latin texts, which we use to comparatively evaluate the AV systems through leave-one-out cross-validation. Our best-performing system achieves high verification accuracy (F1=0.970) despite the heterogeneity of the corpus in terms of textual genre. The key contribution to the accuracy of this system is shown to come from Distributional Random Oversampling (DRO), a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling
