Accounting for shared covariates in semi-parametric Bayesian additive regression trees
Estev\~ao B. Prado, Andrew C. Parnell, Keefe Murphy, Nathan McJames, Ann O'Shea, Rafael A. Moral

TL;DR
This paper introduces an extension to semi-parametric Bayesian additive regression trees (BART) that allows shared covariates between the linear and tree components, improving modeling of complex interactions and interpretability.
Contribution
We develop a novel BART tree-generation method to handle shared covariates, addressing bias and non-identifiability in semi-parametric models with overlapping predictors.
Findings
Competitive performance on benchmark datasets
Improved modeling of covariate interactions
Enhanced interpretability of predictors in semi-parametric BART
Abstract
We propose some extensions to semi-parametric models based on Bayesian additive regression trees (BART). In the semi-parametric BART paradigm, the response variable is approximated by a linear predictor and a BART model, where the linear component is responsible for estimating the main effects and BART accounts for non-specified interactions and non-linearities. Previous semi-parametric models based on BART have assumed that the set of covariates in the linear predictor and the BART model are mutually exclusive in an attempt to avoid poor coverage properties and reduce bias in the estimates of the parameters in the linear predictor. The main novelty in our approach lies in the way we change the tree-generation moves in BART to deal with this bias and resolve non-identifiability issues between the parametric and non-parametric components, even when they have covariates in common. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Bayesian Modeling and Causal Inference
Methods{Quick~24x7} Seven Proven Methods to Fix (QuickBooks Payroll Error by Reaching Phone · Multi-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Dropout · Dense Connections · Adam · Byte Pair Encoding · Softmax
