Automatic Segmentation of Manipuri (Meiteilon) Word into Syllabic Units
Kishorjit Nongmeikapam, Vidya Raj RK, Oinam Imocha Singh, Sivaji, Bandyopadhyay

TL;DR
This paper presents an algorithm for automatic syllable segmentation of Manipuri words in Meitei Mayek script, achieving promising accuracy metrics for the first such attempt for this language.
Contribution
The paper introduces the first algorithm for syllable segmentation of Manipuri words in Meitei Mayek script, demonstrating effective performance metrics.
Findings
Recall of 74.77%
Precision of 91.21%
F-Score of 82.18%
Abstract
The work of automatic segmentation of a Manipuri language (or Meiteilon) word into syllabic units is demonstrated in this paper. This language is a scheduled Indian language of Tibeto-Burman origin, which is also a very highly agglutinative language. This language usages two script: a Bengali script and Meitei Mayek (Script). The present work is based on the second script. An algorithm is designed so as to identify mainly the syllables of Manipuri origin word. The result of the algorithm shows a Recall of 74.77, Precision of 91.21 and F-Score of 82.18 which is a reasonable score with the first attempt of such kind for this language.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · South Asian Studies and Conflicts · Language, Linguistics, Cultural Analysis
