Towards the Study of Morphological Processing of the Tangkhul Language
Mirinso Shadang, Navanath Saharia, Thoudam Doren Singh

TL;DR
This paper initiates the study of morphological processing for the Tangkhul language using an unsupervised approach, demonstrating promising results with a small corpus for morpheme identification.
Contribution
It presents the first attempt at morphological processing of Tangkhul language employing an unsupervised method with a small corpus.
Findings
Morpheme identification yields reasonable results
Unsupervised approach is effective with limited data
Provides a foundation for future NLP work on Tangkhul
Abstract
There is no or little work on natural language processing of Tangkhul language. The current work is a humble beginning of morphological processing of this language using an unsupervised approach. We use a small corpus collected from different sources of text books, short stories and articles of other topics. Based on the experiments carried out, the morpheme identification task using morphessor gives reasonable and interesting output despite using a small corpus.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Language and cultural evolution
