Automatic Segmentation of Manipuri (Meiteilon) Word into Syllabic Units

Kishorjit Nongmeikapam; Vidya Raj RK; Oinam Imocha Singh; Sivaji; Bandyopadhyay

arXiv:1207.3932·cs.CL·July 18, 2012

Automatic Segmentation of Manipuri (Meiteilon) Word into Syllabic Units

Kishorjit Nongmeikapam, Vidya Raj RK, Oinam Imocha Singh, Sivaji, Bandyopadhyay

PDF

Open Access

TL;DR

This paper presents an algorithm for automatic syllable segmentation of Manipuri words in Meitei Mayek script, achieving promising accuracy metrics for the first such attempt for this language.

Contribution

The paper introduces the first algorithm for syllable segmentation of Manipuri words in Meitei Mayek script, demonstrating effective performance metrics.

Findings

01

Recall of 74.77%

02

Precision of 91.21%

03

F-Score of 82.18%

Abstract

The work of automatic segmentation of a Manipuri language (or Meiteilon) word into syllabic units is demonstrated in this paper. This language is a scheduled Indian language of Tibeto-Burman origin, which is also a very highly agglutinative language. This language usages two script: a Bengali script and Meitei Mayek (Script). The present work is based on the second script. An algorithm is designed so as to identify mainly the syllables of Manipuri origin word. The result of the algorithm shows a Recall of 74.77, Precision of 91.21 and F-Score of 82.18 which is a reasonable score with the first attempt of such kind for this language.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · South Asian Studies and Conflicts · Language, Linguistics, Cultural Analysis