Explainable detection: a transformer-based language modeling approach for Bengali news title classification with comparative explainability analysis using ML and DL
Md. Julkar Naeen, Sourav Kumar Das, Sakib Alam Jisan, Sharun Akter Khushbu, Noyon Chandra Saha, Ohidujjaman

TL;DR
This paper explores using transformer models for classifying Bengali news titles, comparing them with ML and LSTM models, and emphasizes explainability in AI for low-resource languages.
Contribution
The study introduces transformer-based models for Bengali text classification and integrates explainable AI techniques to improve transparency.
Findings
XLM-RoBERTa Base achieved the highest accuracy of 0.91 in classifying Bengali news titles.
Explainable AI techniques like LIME were used to identify key features influencing classification outcomes.
Transformer models outperformed traditional ML and LSTM models in Bengali text classification.
Abstract
Classifying scattered Bengali text is the primary focus of this study, with an emphasis on explainability in Natural Language Processing (NLP) for low-resource languages. We employed supervised Machine Learning (ML) models as a baseline and compared their performance with Long Short-Term Memory (LSTM) networks from the deep learning domain. Subsequently, we implemented transformer models designed for sequential learning. To prepare the dataset, we collected recent Bengali news articles online and performed extensive feature engineering. Given the inherent noise in Bengali datasets, significant preprocessing was required. Among the models tested, XLM-RoBERTa Base achieved the highest accuracy 0.91. Furthermore, we integrated explainable AI techniques to interpret the model’s predictions, enhancing transparency and fostering trust in the classification outcomes. Additionally, we employed…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Computational and Text Analysis Methods
