A Comprehensive Machine Learning Framework for Heart Disease Prediction: Performance Evaluation and Future Perspectives

Ali Azimi Lamir; Shiva Razzagzadeh; Zeynab Rezaei

arXiv:2505.09969·cs.LG·May 16, 2025

A Comprehensive Machine Learning Framework for Heart Disease Prediction: Performance Evaluation and Future Perspectives

Ali Azimi Lamir, Shiva Razzagzadeh, Zeynab Rezaei

PDF

Open Access

TL;DR

This paper develops and evaluates a machine learning framework for heart disease prediction, demonstrating high accuracy with Random Forest and discussing future improvements with larger datasets.

Contribution

It introduces a comprehensive ML framework with hyperparameter tuning for heart disease prediction, highlighting the effectiveness of Random Forest over other classifiers.

Findings

01

Random Forest achieved 91% accuracy

02

Model showed balanced performance across metrics

03

Highlights need for larger datasets for better generalization

Abstract

This study presents a machine learning-based framework for heart disease prediction using the heart-disease dataset, comprising 303 samples with 14 features. The methodology involves data preprocessing, model training, and evaluation using three classifiers: Logistic Regression, K-Nearest Neighbors (KNN), and Random Forest. Hyperparameter tuning with GridSearchCV and RandomizedSearchCV was employed to enhance model performance. The Random Forest classifier outperformed other models, achieving an accuracy of 91% and an F1-score of 0.89. Evaluation metrics, including precision, recall, and confusion matrix, revealed balanced performance across classes. The proposed model demonstrates strong potential for aiding clinical decision-making by effectively predicting heart disease. Limitations such as dataset size and generalizability underscore the need for future studies using larger and more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare

MethodsLogistic Regression