A Subword Level Language Model for Bangla Language
Aisha Khatun, Anisur Rahman, Hemayet Ahmed Chowdhury, Md. Saiful Islam, and Ayesha Tasnim

TL;DR
This paper introduces a subword level neural language model for Bangla that significantly reduces perplexity compared to previous models, enhancing NLP applications for the language.
Contribution
It presents a novel subword level neural language model tailored for Bangla, addressing resource scarcity and high perplexity issues in existing models.
Findings
Perplexity reduced to 39.84 after 20 epochs
Model trained on 28.5 million Bangla tokens
Significant performance improvement over previous models
Abstract
Language models are at the core of natural language processing. The ability to represent natural language gives rise to its applications in numerous NLP tasks including text classification, summarization, and translation. Research in this area is very limited in Bangla due to the scarcity of resources, except for some count-based models and very recent neural language models being proposed, which are all based on words and limited in practical tasks due to their high perplexity. This paper attempts to approach this issue of perplexity and proposes a subword level neural language model with the AWD-LSTM architecture and various other techniques suitable for training in Bangla language. The model is trained on a corpus of Bangla newspaper articles of an appreciable size consisting of more than 28.5 million word tokens. The performance comparison with various other models depicts the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDropout · Sigmoid Activation · Tanh Activation · Temporal Activation Regularization · DropConnect · Long Short-Term Memory · Activation Regularization · Embedding Dropout · Non-monotonically Triggered ASGD · Variational Dropout
