Building Odia Shallow Parser
Pruthwik Mishra, Dipti Misra Sharma

TL;DR
This paper presents the creation of annotated corpora and baseline systems for shallow parsing in Odia, addressing resource scarcity in Indian languages for NLP applications.
Contribution
It introduces POS and chunk annotated corpora for Odia and develops initial baseline systems for POS tagging and chunking tasks.
Findings
Created quality annotated corpora for Odia
Developed baseline POS tagging system
Developed baseline chunking system
Abstract
Shallow parsing is an essential task for many NLP applications like machine translation, summarization, sentiment analysis, aspect identification and many more. Quality annotated corpora is critical for building accurate shallow parsers. Many Indian languages are resource poor with respect to the availability of corpora in general. So, this paper is an attempt towards creating quality corpora for shallow parsers. The contribution of this paper is two folds: creation pos and chunk annotated corpora for Odia and development of baseline systems for pos tagging and chunking in Odia.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
