Predicting blood pressure under circumstances of missing data: An analysis of missing data patterns and imputation methods using NHANES
Harish Chauhan, Nikunj Gupta, Zoe Haskell-Craig

TL;DR
This study evaluates various imputation methods for handling missing data in NHANES to improve blood pressure prediction accuracy based on diet and activity data.
Contribution
It systematically compares imputation techniques under different missing data patterns and applies the best methods to enhance blood pressure prediction models.
Findings
Imputation methods significantly affect blood pressure prediction accuracy.
Certain imputation techniques outperform others depending on missing data patterns.
Complete case analysis often reduces data utility and accuracy.
Abstract
The World Health Organization defines cardio-vascular disease (CVD) as "a group of disorders of the heart and blood vessels," including coronary heart disease and stroke (WHO 21). CVD is affected by "intermediate risk factors" such as raised blood pressure, raised blood glucose, raised blood lipids, and obesity. These are predominantly influenced by lifestyle and behaviour, including physical inactivity, unhealthy diets, high intake of salt, and tobacco and alcohol use. However, genetics and social/environmental factors such as poverty, stress, and racism also play an important role. Researchers studying the behavioural and environmental factors associated with these "intermediate risk factors" need access to high quality and detailed information on diet and physical activity. However, missing data are a pervasive problem in clinical and public health research, affecting both randomized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsdemographic modeling and climate adaptation · Health disparities and outcomes · Statistical Methods and Bayesian Inference
