Harnessing the Power of Choices in Decision Tree Learning
Guy Blanc, Jane Lange, Chirag Pabbaraju, Colin Sullivan, Li-Yang Tan,, Mo Tiwari

TL;DR
This paper introduces Top-k, a decision tree learning algorithm that considers multiple top attributes at each split, leading to improved accuracy and scalability over traditional greedy and optimal methods.
Contribution
The paper presents Top-k, a simple yet powerful generalization of decision tree algorithms, with theoretical guarantees and empirical evidence of its superiority.
Findings
Top-k can significantly outperform single-attribute greedy algorithms.
Top-k achieves accuracy close to optimal decision trees in experiments.
Top-k scales better than optimal algorithms, handling larger datasets.
Abstract
We propose a simple generalization of standard and empirically successful decision tree learning algorithms such as ID3, C4.5, and CART. These algorithms, which have been central to machine learning for decades, are greedy in nature: they grow a decision tree by iteratively splitting on the best attribute. Our algorithm, Top-, considers the best attributes as possible splits instead of just the single best attribute. We demonstrate, theoretically and empirically, the power of this simple generalization. We first prove a {\sl greediness hierarchy theorem} showing that for every , Top- can be dramatically more powerful than Top-: there are data distributions for which the former achieves accuracy , whereas the latter only achieves accuracy . We then show, through extensive experiments, that Top- outperforms the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsImbalanced Data Classification Techniques · Data Mining Algorithms and Applications · Machine Learning and Data Classification
