A new TAG Formalism for Tamil and Parser Analytics
Vijay Krishna Menon, S. Rajendran, M. Anand Kumar, K.P. Soman

TL;DR
This paper introduces a simplified TAG formalism tailored for Tamil, addressing previous complexities by focusing less on morphology and providing a parser implementation with notable variations from existing systems.
Contribution
The paper presents a minimalistic TAG formalism for Tamil and a corresponding parser, simplifying previous approaches and adapting the XTAG system for Tamil language processing.
Findings
Successfully designed a simplified TAG for Tamil
Developed a parser with variations from XTAG
Demonstrated effectiveness on Tamil language data
Abstract
Tree adjoining grammar (TAG) is specifically suited for morph rich and agglutinated languages like Tamil due to its psycho linguistic features and parse time dependency and morph resolution. Though TAG and LTAG formalisms have been known for about 3 decades, efforts on designing TAG Syntax for Tamil have not been entirely successful due to the complexity of its specification and the rich morphology of Tamil language. In this paper we present a minimalistic TAG for Tamil without much morphological considerations and also introduce a parser implementation with some obvious variations from the XTAG system
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
