Classification Trees with Valid Inference via the Exponential Mechanism
Soham Bakshi, Snigdha Panigrahi

TL;DR
This paper introduces a probabilistic tree-fitting method using the exponential mechanism, enabling valid inference on classification trees while maintaining predictive accuracy.
Contribution
It proposes a novel, non-greedy tree-fitting approach that facilitates inference by incorporating the exponential mechanism into the splitting process.
Findings
Method achieves asymptotically valid inference on tree parameters.
It maintains predictive accuracy comparable to standard algorithms.
Enables inference without data splitting, unlike traditional methods.
Abstract
Decision trees are widely used for non-linear modeling, as they capture interactions between predictors while producing inherently interpretable models. Despite their popularity, performing inference on the non-linear fit remains largely unaddressed. This paper focuses on classification trees and makes two key contributions. First, we introduce a novel tree-fitting method that replaces the greedy splitting of the predictor space in standard tree algorithms with a probabilistic approach. Each split in our approach is selected according to sampling probabilities defined by an exponential mechanism, with a temperature parameter controlling its deviation from the deterministic choice given data. Second, while our approach can fit a tree that with high probability coincides with the fit produced by standard tree algorithms at low temperatures, it is not merely predictive; unlike standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
