Unlocking the Potential of Multiple BERT Models for Bangla Question Answering in NCTB Textbooks
Abdullah Khondoker, Enam Ahmed Taufik, Md Iftekhar Islam Tashik, S M, Ishtiak mahmud, Antara Firoz Parsa

TL;DR
This study evaluates the effectiveness of multiple BERT-based models in automatically assessing Bangla passage-based questions from textbooks, highlighting Bangla-BERT's superior performance and the importance of hyperparameter tuning.
Contribution
It introduces a comparative analysis of state-of-the-art Bangla language models for question answering in educational texts, emphasizing model performance and hyperparameter effects.
Findings
Bangla-BERT outperforms RoBERTa and BERT in F1 and EM scores.
Hyperparameter tuning significantly impacts model accuracy.
Limitations include dataset size and spelling inconsistencies.
Abstract
Evaluating text comprehension in educational settings is critical for understanding student performance and improving curricular effectiveness. This study investigates the capability of state-of-the-art language models-RoBERTa Base, Bangla-BERT, and BERT Base-in automatically assessing Bangla passage-based question-answering from the National Curriculum and Textbook Board (NCTB) textbooks for classes 6-10. A dataset of approximately 3,000 Bangla passage-based question-answering instances was compiled, and the models were evaluated using F1 Score and Exact Match (EM) metrics across various hyperparameter configurations. Our findings revealed that Bangla-BERT consistently outperformed the other models, achieving the highest F1 (0.75) and EM (0.53) scores, particularly with smaller batch sizes, the inclusion of stop words, and a moderate learning rate. In contrast, RoBERTa Base…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Assessment and Pedagogy
MethodsAttention Is All You Need · Linear Layer · Dense Connections · Residual Connection · Adam · Weight Decay · Multi-Head Attention · RoBERTa · Layer Normalization · WordPiece
