OntoSenseNet: A Verb-Centric Ontological Resource for Indian Languages
Jyoti Jha, Sreekavitha Parupalli, Navjyoti Singh

TL;DR
This paper introduces OntoSenseNet, a lexical resource for Hindi and Telugu that captures intrinsic and extrinsic word meanings based on Indian linguistic traditions and modern ontological frameworks.
Contribution
It develops a gold-standard, manually annotated lexical resource for Indian languages, integrating sense-types and sense-classes for semantic understanding.
Findings
Distribution of verb sense-classes varies across corpora
Word embeddings aid resource enrichment
Resource supports semantic analysis in Indian languages
Abstract
Following approaches for understanding lexical meaning developed by Yaska, Patanjali and Bhartrihari from Indian linguistic traditions and extending approaches developed by Leibniz and Brentano in the modern times, a framework of formal ontology of language was developed. This framework proposes that meaning of words are in-formed by intrinsic and extrinsic ontological structures. The paper aims to capture such intrinsic and extrinsic meanings of words for two major Indian languages, namely, Hindi and Telugu. Parts-of-speech have been rendered into sense-types and sense-classes. Using them we have developed a gold- standard annotated lexical resource to support semantic understanding of a language. The resource has collection of Hindi and Telugu lexicons, which has been manually annotated by native speakers of the languages following our annotation guidelines. Further, the resource was…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
