A Precision Environment-Wide Association Study of Hypertension via Supervised Cadre Models
Alexander New, Kristin P. Bennett

TL;DR
This paper introduces an extended supervised cadre model (SCM) to identify subpopulations with distinct environmental risk factors for hypertension, revealing both known and novel associations in a complex survey setting.
Contribution
It extends the SCM framework to multivariate and binary data, enabling environment-wide association studies to uncover heterogeneity in health risk factors.
Findings
Identified 25 environmental factors significantly associated with blood pressure and hypertension.
Discovered subpopulations with unique risk profiles not evident in the overall population.
Validated the interpretability and potential for further research of the identified subpopulations.
Abstract
We consider the problem in precision health of grouping people into subpopulations based on their degree of vulnerability to a risk factor. These subpopulations cannot be discovered with traditional clustering techniques because their quality is evaluated with a supervised metric: the ease of modeling a response variable over observations within them. Instead, we apply the supervised cadre model (SCM), which does use this metric. We extend the SCM formalism so that it may be applied to multivariate regression and binary classification problems. We also develop a way to use conditional entropy to assess the confidence in the process by which a subject is assigned their cadre. Using the SCM, we generalize the environment-wide association study (EWAS) workflow to be able to model heterogeneity in population risk. In our EWAS, we consider more than two hundred environmental exposure factors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
