Generalization Properties of Decision Trees on Real-valued and Categorical Features
Jean-Samuel Leboeuf, Fr\'ed\'eric LeBlanc, Mario Marchand

TL;DR
This paper analyzes the generalization capabilities of decision trees with different feature types, deriving VC dimension bounds and proposing an effective pruning algorithm that outperforms traditional methods without cross-validation.
Contribution
It introduces the notion of partitioning function for decision trees, derives exact VC dimension for decision stumps, and provides bounds for general trees, along with a novel pruning algorithm.
Findings
Exact VC dimension of decision stumps on real-valued features derived
VC dimension of binary trees with L_T leaves is in O(L_T log(L_T )
Proposed pruning algorithm outperforms traditional methods without cross-validation
Abstract
We revisit binary decision trees from the perspective of partitions of the data. We introduce the notion of partitioning function, and we relate it to the growth function and to the VC dimension. We consider three types of features: real-valued, categorical ordinal and categorical nominal, with different split rules for each. For each feature type, we upper bound the partitioning function of the class of decision stumps before extending the bounds to the class of general decision tree (of any fixed structure) using a recursive approach. Using these new results, we are able to find the exact VC dimension of decision stumps on examples of real-valued features, which is given by the largest integer such that . Furthermore, we show that the VC dimension of a binary tree structure with leaves on examples of real-valued…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Bayesian Modeling and Causal Inference · Neural Networks and Applications
MethodsPruning
