Oblique Bayesian additive regression trees
Paul-Hieu V. Nguyen, Ryan Yee, Sameer K. Deshpande

TL;DR
This paper introduces an oblique Bayesian Additive Regression Trees (BART) model that uses linear combinations of features for decision rules, improving prediction accuracy over traditional axis-aligned BART in various datasets.
Contribution
The paper develops an oblique BART model with a data-adaptive prior, enabling recursive partitioning along hyperplanes, which enhances predictive performance.
Findings
Oblique BART is competitive with existing methods.
Oblique BART often outperforms axis-aligned BART.
The method shows strong results on synthetic and real datasets.
Abstract
Current implementations of Bayesian Additive Regression Trees (BART) are based on axis-aligned decision rules that recursively partition the feature space using a single feature at a time. Several authors have demonstrated that oblique trees, whose decision rules are based on linear combinations of features, can sometimes yield better predictions than axis-aligned trees and exhibit excellent theoretical properties. We develop an oblique version of BART that leverages a data-adaptive decision rule prior that recursively partitions the feature space along random hyperplanes. Using several synthetic and real-world benchmark datasets, we systematically compared our oblique BART implementation to axis-aligned BART and other tree ensemble methods, finding that oblique BART was competitive with -- and sometimes much better than -- those methods.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Bayesian Methods and Mixture Models · Statistical Methods and Inference
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Dense Connections · Layer Normalization · Adam · Multi-Head Attention · Residual Connection · Softmax
