Predicting Heart Disease and Reducing Survey Time Using Machine Learning Algorithms
Salahaldeen Rababa, Asma Yamin, Shuxia Lu, Ashraf Obaidat

TL;DR
This paper explores machine learning techniques to predict heart disease from CDC survey data, achieving high accuracy and reducing survey time by 77%, thus enhancing early diagnosis and efficiency.
Contribution
It demonstrates the effectiveness of feature selection and stability analysis in improving heart disease prediction accuracy and reducing survey length.
Findings
Survey data can predict heart disease with up to 80% accuracy.
Survey time can be reduced by 77% without losing predictive performance.
Machine learning models outperform traditional diagnostic methods.
Abstract
Currently, many researchers and analysts are working toward medical diagnosis enhancement for various diseases. Heart disease is one of the common diseases that can be considered a significant cause of mortality worldwide. Early detection of heart disease significantly helps in reducing the risk of heart failure. Consequently, the Centers for Disease Control and Prevention (CDC) conducts a health-related telephone survey yearly from over 400,000 participants. However, several concerns arise regarding the reliability of the data in predicting heart disease and whether all of the survey questions are strongly related. This study aims to utilize several machine learning techniques, such as support vector machines and logistic regression, to investigate the accuracy of the CDC's heart disease survey in the United States. Furthermore, we use various feature selection methods to identify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare
MethodsFeature Selection
