Discovery and inference beyond linearity by integrating Bayesian regression, tree ensembles and Shapley values

Giorgio Spadaccini; Marjolein Fokkema; Mark A. van de Wiel

arXiv:2505.00571·stat.ML·January 1, 2026

Discovery and inference beyond linearity by integrating Bayesian regression, tree ensembles and Shapley values

Giorgio Spadaccini, Marjolein Fokkema, Mark A. van de Wiel

PDF

1 Repo

TL;DR

RuleSHAP is a novel framework that combines Bayesian regression, tree rules, and Shapley values to detect nonlinear effects and interactions in healthcare data with reliable uncertainty quantification.

Contribution

It introduces RuleSHAP, a new method integrating Bayesian sparse regression and tree-based rules to enable valid inference of feature effects in complex models.

Findings

01

Successfully detects nonlinear and interaction effects in simulated data.

02

Identifies significant effects in epidemiological cohort data.

03

Provides uncertainty quantification for individual feature effects.

Abstract

Machine Learning (ML) is gaining popularity for hypothesis-free discovery of risk and protective factors in healthcare studies. ML is strong at discovering nonlinearities and interactions, but this power is compromised by a lack of reliable inference. Although Shapley values provide local measures of features' effects, valid uncertainty quantification for these effects is typically lacking, thus precluding statistical inference. We propose RuleSHAP, a framework that addresses this limitation by combining a dedicated Bayesian sparse regression model with a new tree-based rule generator and Shapley value attribution. RuleSHAP provides detection of nonlinear and interaction effects with uncertainty quantification at the individual level. We derive an efficient formula for computing marginal Shapley values within this framework. We demonstrate the validity of our framework on simulated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

GiorgioSpadaccini/ruleSHAP
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.