A Comprehensive Machine Learning Framework for Heart Disease Prediction: Performance Evaluation and Future Perspectives
Ali Azimi Lamir, Shiva Razzagzadeh, Zeynab Rezaei

TL;DR
This paper develops and evaluates a machine learning framework for heart disease prediction, demonstrating high accuracy with Random Forest and discussing future improvements with larger datasets.
Contribution
It introduces a comprehensive ML framework with hyperparameter tuning for heart disease prediction, highlighting the effectiveness of Random Forest over other classifiers.
Findings
Random Forest achieved 91% accuracy
Model showed balanced performance across metrics
Highlights need for larger datasets for better generalization
Abstract
This study presents a machine learning-based framework for heart disease prediction using the heart-disease dataset, comprising 303 samples with 14 features. The methodology involves data preprocessing, model training, and evaluation using three classifiers: Logistic Regression, K-Nearest Neighbors (KNN), and Random Forest. Hyperparameter tuning with GridSearchCV and RandomizedSearchCV was employed to enhance model performance. The Random Forest classifier outperformed other models, achieving an accuracy of 91% and an F1-score of 0.89. Evaluation metrics, including precision, recall, and confusion matrix, revealed balanced performance across classes. The proposed model demonstrates strong potential for aiding clinical decision-making by effectively predicting heart disease. Limitations such as dataset size and generalizability underscore the need for future studies using larger and more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare
MethodsLogistic Regression
