A Novel Multi-Task Learning Approach for Context-Sensitive Compound Type Identification in Sanskrit
Jivnesh Sandhan, Ashish Gupta, Hrishikesh Terdalkar, Tushar Sandhan,, Suvendu Samanta, Laxmidhar Behera, Pawan Goyal

TL;DR
This paper introduces a multi-task learning model for Sanskrit compound type identification that leverages contextual, syntactic, and morphological information, significantly outperforming previous methods and demonstrating multilingual applicability.
Contribution
The work presents a novel multi-task learning architecture that incorporates context, morphology, and syntax for improved Sanskrit compound type identification.
Findings
Achieved 6.1 points higher accuracy over state-of-the-art.
Achieved 7.7 points higher F1-score.
Demonstrated effectiveness in English and Marathi languages.
Abstract
The phenomenon of compounding is ubiquitous in Sanskrit. It serves for achieving brevity in expressing thoughts, while simultaneously enriching the lexical and structural formation of the language. In this work, we focus on the Sanskrit Compound Type Identification (SaCTI) task, where we consider the problem of identifying semantic relations between the components of a compound word. Earlier approaches solely rely on the lexical information obtained from the components and ignore the most crucial contextual and syntactic information useful for SaCTI. However, the SaCTI task is challenging primarily due to the implicitly encoded context-sensitive semantic relation between the compound components. Thus, we propose a novel multi-task learning architecture which incorporates the contextual information and enriches the complementary syntactic information using morphological tagging and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies
