Machine Learning and Data Science approach towards trend and predictors analysis of CDC Mortality Data for the USA
Yasir Nadeem, Awais Ahmed

TL;DR
This study applies machine learning and data science techniques to analyze CDC mortality data for the USA, uncovering factors like life expectancy and marital status, while highlighting challenges in prediction accuracy due to data complexity.
Contribution
It demonstrates the application of ML and data science methods to mortality data analysis and discusses the limitations of predictive modeling in this context.
Findings
Life expectancy varies across genders and marital statuses.
Anomaly detection and under-sampling are useful for handling class imbalance.
ML predictions face challenges due to data complexity.
Abstract
The research on mortality is an active area of research for any country where the conclusions are driven from the provided data and conditions. The domain knowledge is an essential but not a mandatory skill (though some knowledge is still required) in order to derive conclusions based on data intuition using machine learning and data science practices. The purpose of conducting this project was to derive conclusions based on the statistics from the provided dataset and predict label(s) of the dataset using supervised or unsupervised learning algorithms. The study concluded (based on a sample) life expectancy regardless of gender, and their central tendencies; Marital status of the people also affected how frequent deaths were for each of them. The study also helped in finding out that due to more categorical and numerical data, anomaly detection or under-sampling could be a viable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Insurance, Mortality, Demography, Risk Management · Health, Environment, Cognitive Aging
