TL;DR
This study compares various statistical methods for identifying and estimating non-linear, correlated chemical effects on health, highlighting the strengths of Bayesian approaches over linear models in complex exposure scenarios.
Contribution
It provides a comprehensive simulation-based evaluation of Bayesian kernel machine regression, BART, BSTARSS, and lasso in modeling non-monotonic chemical exposure effects.
Findings
BKMR and BSTARSS showed high specificity and sensitivity in most scenarios.
Lasso performed well in sensitivity but poorly with non-monotonic relationships.
Performance was influenced by signal-to-noise ratio, not correlation structure.
Abstract
Statistical methods for identifying harmful chemicals in a correlated mixture often assume linearity in exposure-response relationships. Non-monotonic relationships are increasingly recognised (e.g., for endocrine-disrupting chemicals); however, the impact of non-monotonicity on exposure selection has not been evaluated. In a simulation study, we assessed the performance of Bayesian kernel machine regression (BKMR), Bayesian additive regression trees (BART), Bayesian structured additive regression with spike-slab priors (BSTARSS), and lasso penalised regression. We used data on exposure to 12 phthalates and phenols in pregnant women from the U.S. National Health and Nutrition Examination Survey to simulate realistic exposure data using a multivariate copula. We simulated datasets of size N = 250 and compared methods across 32 scenarios, varying by model size and sparsity,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
