PLUTO: Penalized Unbiased Logistic Regression Trees
Wenwen Zhang, Wei-Yin Loh

TL;DR
PLUTO is a novel algorithm for constructing logistic regression trees that effectively captures nonlinearities and interactions in binary data, using elastic net penalties and bias control techniques for improved prediction accuracy.
Contribution
It introduces a new method combining recursive partitioning with penalized logistic regression models, addressing high-dimensionality and bias in split variable selection.
Findings
PLUTO outperforms existing algorithms in predictive accuracy on real datasets.
It efficiently handles high-dimensional data with elastic net regularization.
The method provides both classification and probability estimation.
Abstract
We propose a new algorithm called PLUTO for building logistic regression trees to binary response data. PLUTO can capture the nonlinear and interaction patterns in messy data by recursively partitioning the sample space. It fits a simple or a multiple linear logistic regression model in each partition. PLUTO employs the cyclical coordinate descent method for estimation of multiple linear logistic regression models with elastic net penalties, which allows it to deal with high-dimensional data efficiently. The tree structure comprises a graphical description of the data. Together with the logistic regression models, it provides an accurate classifier as well as a piecewise smooth estimate of the probability of "success". PLUTO controls selection bias by: (1) separating split variable selection from split point selection; (2) applying an adjusted chi-squared test to find the split variable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Data Analysis with R
MethodsLogistic Regression
