Analysis and Mortality Prediction using Multiclass Classification for Older Adults with Type 2 Diabetes
Ruchika Desure, Gutha Jaya Krishna

TL;DR
This study explores multiclass classification for predicting mortality in older adults with Type 2 Diabetes using various models, but finds that current approaches yield limited accuracy, highlighting challenges in high-dimensional data and feature interactions.
Contribution
It introduces a novel multiclass classification approach to mortality prediction in older diabetics, addressing data preprocessing and feature selection challenges.
Findings
XGBoost achieved 53.03% accuracy with Chi-Squared feature selection.
Models performed best for patients with over 10 years remaining life.
High dimensionality caused model confusion and misclassification.
Abstract
Designing proper treatment plans to manage diabetes requires health practitioners to pay heed to the individuals remaining life along with the comorbidities affecting them. Older adults with Type 2 Diabetes Mellitus (T2DM) are prone to experience premature death or even hypoglycaemia. The structured dataset utilized has 68 potential mortality predictors for 275,190 diabetic U.S. military Veterans aged 65 years or older. A new target variable is invented by combining the two original target variables. Outliers are handled by discretizing the continuous variables. Categorical variables have been dummy encoded. Class balancing is achieved by random under-sampling. A benchmark regression model is built using Multinomial Logistic Regression with LASSO. Chi-Squared and Information Gain are the filter-based feature selection techniques utilized. Classifiers such as Multinomial Logistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare
MethodsLogistic Regression · Feature Selection
