Improving Bangla Linguistics: Advanced LSTM, Bi-LSTM, and Seq2Seq Models for Translating Sylheti to Modern Bangla
Sourav Kumar Das, Md. Julkar Naeen, MD. Jahidul Islam, Md. Anisul Haque Sajeeb, Narayan Ranjan Chakraborty, Mayen Uddin Mojumdar

TL;DR
This paper develops and compares advanced LSTM-based models for translating Modern Bangla into Sylheti, a regional dialect, demonstrating high accuracy and contributing to Bangla NLP research.
Contribution
It introduces a comprehensive NLP translation system using LSTM, Bi-LSTM, and Seq2Seq models specifically for Sylheti to Bangla translation, with LSTM achieving 89.3% accuracy.
Findings
LSTM model achieved 89.3% accuracy
Seq2Seq and Bi-LSTM models were also evaluated
The research supports further development in Bangla NLP
Abstract
Bangla or Bengali is the national language of Bangladesh, people from different regions don't talk in proper Bangla. Every division of Bangladesh has its own local language like Sylheti, Chittagong etc. In recent years some papers were published on Bangla language like sentiment analysis, fake news detection and classifications, but a few of them were on Bangla languages. This research is for the local language and this particular paper is on Sylheti language. It presented a comprehensive system using Natural Language Processing or NLP techniques for translating Pure or Modern Bangla to locally spoken Sylheti Bangla language. Total 1200 data used for training 3 models LSTM, Bi-LSTM and Seq2Seq and LSTM scored the best in performance with 89.3% accuracy. The findings of this research may contribute to the growth of Bangla NLP researchers for future more advanced innovations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSequence to Sequence · Tanh Activation · Sigmoid Activation · Long Short-Term Memory
