Multi-Armed Bandits-Based Optimization of Decision Trees
Hasibul Karim Shanto, Umme Ayman Koana, Shadikur Rahman

TL;DR
This paper introduces a novel reinforcement learning approach using Multi-Armed Bandits to dynamically prune decision trees, aiming to improve their generalization and predictive performance over traditional greedy pruning methods.
Contribution
It proposes a MAB-based pruning method that treats pruning as an exploration-exploitation problem, enhancing decision tree optimization beyond conventional techniques.
Findings
MAB-based pruning outperforms traditional methods on benchmark datasets.
The approach improves the generalization ability of decision trees.
Experimental results show better predictive accuracy with the proposed method.
Abstract
Decision trees, without appropriate constraints, can easily become overly complex and prone to overfit, capturing noise rather than generalizable patterns. To resolve this problem,pruning operation is a crucial part in optimizing decision trees, as it not only reduces the complexity of trees but also decreases the probability of generating overfit models. The conventional pruning techniques like Cost-Complexity Pruning (CCP) and Reduced Error Pruning (REP) are mostly based on greedy approaches that focus on immediate gains in performance while pruning nodes of the decision tree. However, this might result in a lower generalization in the long run, compromising the robust ability of the tree model when introduced to unseen data samples, particularly when trained with small and complex datasets. To address this challenge, we are proposing a Multi-Armed Bandits (MAB)-based pruning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Data Stream Mining Techniques · Machine Learning and Data Classification
