Bridging the Gap in Bangla Healthcare: Machine Learning Based Disease Prediction Using a Symptoms-Disease Dataset
Rowzatul Zannat, Abdullah Al Shafi, Abdul Muntakim

TL;DR
This paper introduces a new Bangla symptoms-disease dataset and evaluates machine learning models for disease prediction, achieving 98% accuracy to improve healthcare access for Bengali speakers.
Contribution
The study provides the first comprehensive Bangla symptoms-disease dataset and demonstrates high-accuracy disease prediction models tailored for Bangla-speaking populations.
Findings
Achieved 98% accuracy with ensemble models
Developed and publicly released a dataset of 758 symptom-disease pairs
Enhanced disease prediction capabilities for Bangla speakers
Abstract
Increased access to reliable health information is essential for non-English-speaking populations, yet resources in Bangla for disease prediction remain limited. This study addresses this gap by developing a comprehensive Bangla symptoms-disease dataset containing 758 unique symptom-disease relationships spanning 85 diseases. To ensure transparency and reproducibility, we also make our dataset publicly available. The dataset enables the prediction of diseases based on Bangla symptom inputs, supporting healthcare accessibility for Bengali-speaking populations. Using this dataset, we evaluated multiple machine learning models to predict diseases based on symptoms provided in Bangla and analyzed their performance on our dataset. Both soft and hard voting ensemble approaches combining top-performing models achieved 98\% accuracy, demonstrating superior robustness and generalization. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Genomics and Rare Diseases · Artificial Intelligence in Healthcare and Education
