Towards the first UD Treebank of Spoken Italian: the KIParla forest
Ludovica Pannitto

TL;DR
This paper introduces the development of the first Universal Dependencies treebank for spoken Italian, based on the KIParla corpus, to enhance linguistic resources for Italian language processing.
Contribution
It presents the creation of the first UD treebank for spoken Italian using the KIParla corpus, filling a gap in linguistic resources.
Findings
First UD treebank for spoken Italian
Enriches Italian linguistic resources
Supports NLP applications for spoken language
Abstract
The present project endeavors to enrich the linguistic resources available for Italian by constructing a Universal Dependencies treebank for the KIParla corpus (Mauri et al., 2019, Ballar\`e et al., 2020), an existing and well known resource for spoken Italian.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Linguistic Studies and Language Acquisition · Text Readability and Simplification
