Dependency resolution and semantic mining using Tree Adjoining Grammars for Tamil Language
Vijay Krishna Menon, S Rajendran, M Anandkumar, K P Soman

TL;DR
This paper explores using Tree Adjoining Grammars (TAGs) to model Tamil syntax and semantics, demonstrating dependency resolution and semantic mining through parsing Tamil sentences with a custom TAG parser.
Contribution
It presents the first application of TAGs to Tamil language syntax and semantics, including dependency resolution and semantic mining without relying on statistical methods.
Findings
Successfully modeled Tamil syntax with TAGs
Demonstrated semantic mining via dependency resolution
Developed a custom TAG parser for Tamil sentences
Abstract
Tree adjoining grammars (TAGs) provide an ample tool to capture syntax of many Indian languages. Tamil represents a special challenge to computational formalisms as it has extensive agglutinative morphology and a comparatively difficult argument structure. Modelling Tamil syntax and morphology using TAG is an interesting problem which has not been in focus even though TAGs are over 4 decades old, since its inception. Our research with Tamil TAGs have shown us that we can not only represent syntax of the language, but to an extent mine out semantics through dependency resolution of the sentence. But in order to demonstrate this phenomenal property, we need to parse Tamil language sentences using TAGs we have built and through parsing obtain a derivation we could use to resolve dependencies, thus proving the semantic property. We use an in-house developed pseudo lexical TAG chart parser;…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · semigroups and automata theory · Algorithms and Data Compression
