Making Chant Computing Easy: CantusCorpus v1.0 and the PyCantus Library
Anna Dvo\v{r}\'akov\'a, Tim Eipert, Debra Lacoste, Jan Haji\v{c} jr

TL;DR
This paper introduces CantusCorpus v1.0, a comprehensive dataset of Gregorian chant sources, and the PyCantus library, facilitating easier, more transparent computational research in digital chant scholarship.
Contribution
It provides a unified dataset and a flexible library that decouples data from code, enabling broader access and integration of chant data for digital humanities research.
Findings
Created CantusCorpus v1.0 dataset from existing chant databases
Developed PyCantus library for data manipulation and integration
Enhanced accessibility and reproducibility in digital chant research
Abstract
Digital Gregorian chant scholarship has for decades enjoyed the privilege of a large digital resource cataloguing chant sources: the Cantus ecosystem, with nearly 900,000 chants catalogued across more than 2000 sources. The Cantus Database data model and the Cantus ID mechanism has been adopted by 18 more chant databases, jointly accessible through the Cantus Index interface. However, this data has only been available piecemeal via the individual online user interfaces; computational methods have so far had only a limited opportunity to process these immense resources. To overcome this hurdle, we compiled CantusCorpus v1.0, a dataset that combines everything that was available across the Cantus Index-centered network of databases as of mid-2025, and we have also provided the code for updating the dataset as the databases grow. We then created the lightweight PyCantus library for working…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Digital Humanities and Scholarship · Diverse Musicological Studies
