Dravidian language family through Universal Dependencies lens
Taraka Rama, Sowmya Vajjala

TL;DR
This paper explores how to adapt the Universal Dependencies framework to accurately represent the unique morphological and syntactic features of Dravidian languages, which are underrepresented in the current UD support.
Contribution
It provides an analysis of Dravidian languages' features and proposes methods for their annotation within the UD framework, addressing a gap in multilingual NLP resources.
Findings
Identification of key morphological features of Dravidian languages
Proposed annotation strategies for syntactic structures
Enhanced compatibility of Dravidian languages with UD standards
Abstract
The Universal Dependencies (UD) project aims to create a cross-linguistically consistent dependency annotation for multiple languages, to facilitate multilingual NLP. It currently supports 114 languages. Dravidian languages are spoken by over 200 million people across the word, and yet there are only two languages from this family in UD. This paper examines some of the morphological and syntactic features of Dravidian languages and explores how they can be annotated in the UD framework.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Language and cultural evolution · Speech Recognition and Synthesis
