Transformer Based Bengali Chatbot Using General Knowledge Dataset
Abu Kaisar Mohammad Masum, Sheikh Abujar, Sharmin Akter, Nushrat Jahan, Ria, Syed Akhter Hossain

TL;DR
This paper develops a Bengali chatbot using transformer architecture trained on a general knowledge dataset, achieving high BLEU scores and outperforming traditional seq2seq models with attention.
Contribution
It introduces a transformer-based approach for Bengali chatbots and demonstrates significant performance improvements over seq2seq models on a general knowledge dataset.
Findings
Transformer model scored 85.0 BLEU on Bengali QA dataset.
Seq2seq with attention scored 23.5 BLEU, showing transformer superiority.
Transformer reduces training time compared to RNN-based models.
Abstract
An AI chatbot provides an impressive response after learning from the trained dataset. In this decade, most of the research work demonstrates that deep neural models superior to any other model. RNN model regularly used for determining the sequence-related problem like a question and it answers. This approach acquainted with everyone as seq2seq learning. In a seq2seq model mechanism, it has encoder and decoder. The encoder embedded any input sequence, and the decoder embedded output sequence. For reinforcing the seq2seq model performance, attention mechanism added into the encoder and decoder. After that, the transformer model has introduced itself as a high-performance model with multiple attention mechanism for solving the sequence-related dilemma. This model reduces training time compared with RNN based model and also achieved state-of-the-art performance for sequence transduction.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
