Semi-Parametric Bayesian Additive Regression Trees for Risk Prediction with High-Dimensional Epigenetic Signatures and Low-Dimensional Covariates
Saurabh Bhandari, Brian C.-H. Chiu, Parveen Bhatti, Yuan Ji

TL;DR
This paper introduces spBART, a semi-parametric Bayesian model that combines interpretable covariate effects with flexible high-dimensional predictor modeling, demonstrated on epigenetic data for risk prediction.
Contribution
The paper develops a novel semi-parametric BART model that separates covariate effects from high-dimensional predictors, enabling interpretability and effective variable selection.
Findings
Achieved high predictive accuracy with AUC=0.96 in multiple myeloma data.
Identified a parsimonious set of candidate loci associated with risk.
Provided a stable variable selection procedure using cross-validation and FDR control.
Abstract
In the era of precision medicine, genome-wide epigenetic modifications offer rich data that could inform risk prediction. However, these data are high-dimensional and exhibit complex dependence structures, which makes it difficult to jointly model them with low-dimensional covariates when the goal is to obtain interpretable effect estimates for covariate adjustment. Standard Bayesian additive regression trees (BART) provide strong predictive performance but treat all predictors uniformly within the tree ensemble, obscuring the contributions of significant covariates and complicating variable selection in high-dimensional settings. We propose a semi-parametric BART model (spBART) that addresses this limitation by modeling low-dimensional covariates through a parametric component with interpretable coefficients, while capturing complex nonlinear associations among high-dimensional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
