Towards the Study of Morphological Processing of the Tangkhul Language

Mirinso Shadang; Navanath Saharia; Thoudam Doren Singh

arXiv:2006.16212·cs.CL·June 30, 2020

Towards the Study of Morphological Processing of the Tangkhul Language

Mirinso Shadang, Navanath Saharia, Thoudam Doren Singh

PDF

Open Access

TL;DR

This paper initiates the study of morphological processing for the Tangkhul language using an unsupervised approach, demonstrating promising results with a small corpus for morpheme identification.

Contribution

It presents the first attempt at morphological processing of Tangkhul language employing an unsupervised method with a small corpus.

Findings

01

Morpheme identification yields reasonable results

02

Unsupervised approach is effective with limited data

03

Provides a foundation for future NLP work on Tangkhul

Abstract

There is no or little work on natural language processing of Tangkhul language. The current work is a humble beginning of morphological processing of this language using an unsupervised approach. We use a small corpus collected from different sources of text books, short stories and articles of other topics. Based on the experiments carried out, the morpheme identification task using morphessor gives reasonable and interesting output despite using a small corpus.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Language and cultural evolution