Speech Analysis of Language Varieties in Italy
Moreno La Quatra, Alkis Koudounas, Elena Baralis, Sabato Marco Siniscalchi

TL;DR
This paper explores using self-supervised and contrastive learning techniques to analyze Italy's diverse regional speech varieties, achieving effective geographic classification and revealing linguistic relationships.
Contribution
It introduces a novel approach combining self-supervised and contrastive learning to identify Italian regional speech varieties and analyze their linguistic relationships.
Findings
Self-supervised models effectively classify regional speech.
Contrastive learning improves discriminative embeddings.
Enhanced understanding of linguistic relationships among Italian varieties.
Abstract
Italy exhibits rich linguistic diversity across its territory due to the distinct regional languages spoken in different areas. Recent advances in self-supervised learning provide new opportunities to analyze Italy's linguistic varieties using speech data alone. This includes the potential to leverage representations learned from large amounts of data to better examine nuances between closely related linguistic varieties. In this study, we focus on automatically identifying the geographic region of origin of speech samples drawn from Italy's diverse language varieties. We leverage self-supervised learning models to tackle this task and analyze differences and similarities between Italy's regional languages. In doing so, we also seek to uncover new insights into the relationships among these diverse yet closely related varieties, which may help linguists understand their interconnected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLinguistic Studies and Language Acquisition · Language and Culture
MethodsFocus · Contrastive Learning
