HMLA: A hybrid machine learning approach for enhancing stroke prediction models with missing data imputation techniques
M. Sheetal Singh, Khelchandra Thongam, Krishna Kumar, Prakash Choudhary

TL;DR
This paper introduces a hybrid machine learning method combining feature selection and missing data imputation to improve stroke prediction accuracy in clinical datasets.
Contribution
The novel hybrid IGR–KNN–DNN framework enhances stroke prediction with missing data handling and feature selection.
Findings
The model achieved 94.32% accuracy and 95.96% precision in stroke prediction.
The framework showed high computational efficiency and strong predictive performance compared to classical methods.
Results suggest the need for external validation to confirm clinical applicability.
Abstract
Early and accurate stroke prediction is critical to reduce death and disability risk, despite the presence of irrelevant and sparse information in clinical datasets that often undermines model performance. The novel machine learning approach is proposed for stroke prediction in the Cardiovascular Health Study (CHS) dataset. The proposed approach consists of two steps. The important features are selected using the Information Gain Ratio (IGR) during the preprocessing, and missing data handled by K-Nearest Neighbour (KNN), which also helps to enhance data integrity as well computing efficiency. Following the classification phase, a Deep Neural Network (DNN) model is trained on the preprocessed information to predict stroke risk. After classification, a DNN model is further trained using preprocessed data to predict the risk of stroke. Model assessment was based on a combined 10-fold…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAcute Ischemic Stroke Management · Artificial Intelligence in Healthcare · Machine Learning in Healthcare
