Automated Word Prediction in Bangla Language Using Stochastic Language   Models

Md. Masudul Haque; Md. Tarek Habib; Md. Mokhlesur Rahman

arXiv:1602.07803·cs.CL·February 26, 2016

Automated Word Prediction in Bangla Language Using Stochastic Language Models

Md. Masudul Haque, Md. Tarek Habib, Md. Mokhlesur Rahman

PDF

TL;DR

This paper explores the use of stochastic N-gram language models for automated word prediction in Bangla, aiming to improve typing efficiency and accuracy, especially for users with disabilities.

Contribution

It introduces the application of various N-gram models to Bangla word prediction, providing a baseline for automated Bangla typing systems.

Findings

01

High prediction accuracy achieved with N-gram models

02

Effective auto-completion reduces keystrokes and misspellings

03

Promising results suggest potential for real-world application

Abstract

Word completion and word prediction are two important phenomena in typing that benefit users who type using keyboard or other similar devices. They can have profound impact on the typing of disable people. Our work is based on word prediction on Bangla sentence by using stochastic, i.e. N-gram language model such as unigram, bigram, trigram, deleted Interpolation and backoff models for auto completing a sentence by predicting a correct word in a sentence which saves time and keystrokes of typing and also reduces misspelling. We use large data corpus of Bangla language of different word types to predict correct word with the accuracy as much as possible. We have found promising results. We hope that our work will impact on the baseline for automated Bangla typing.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.