Optimizing Mortality Prediction for ICU Heart Failure Patients: Leveraging XGBoost and Advanced Machine Learning with the MIMIC-III Database
Negin Ashrafi, Armin Abdollahi, Jiahong Zhang, Maryam Pishgar

TL;DR
This study develops an advanced machine learning model using XGBoost and comprehensive feature selection to accurately predict mortality in ICU heart failure patients, outperforming previous models and providing clinical insights.
Contribution
The paper introduces a novel predictive framework combining advanced feature selection, preprocessing, and hyperparameter tuning with XGBoost for ICU heart failure mortality prediction.
Findings
XGBoost achieved a test AUC-ROC of 0.9228, outperforming previous models.
Key features like leucocyte count and RDW were identified as significant predictors.
The framework enhances clinical decision-making for high-risk ICU heart failure patients.
Abstract
Heart failure affects millions of people worldwide, significantly reducing quality of life and leading to high mortality rates. Despite extensive research, the relationship between heart failure and mortality rates among ICU patients is not fully understood, indicating the need for more accurate prediction models. This study analyzed data from 1,177 patients over 18 years old from the MIMIC-III database, identified using ICD-9 codes. Preprocessing steps included handling missing data, removing duplicates, treating skewness, and using oversampling techniques to address data imbalances. Through rigorous feature selection using Variance Inflation Factor (VIF), expert clinical input, and ablation studies, 46 key features were identified to enhance model performance. Our analysis compared several machine learning models, including Logistic Regression, Support Vector Machine (SVM), Random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare
MethodsShapley Additive Explanations · Feature Selection · Logistic Regression
