Extremely Fast Decision Tree
Chaitanya Manapragada, Geoff Webb, Mahsa Salehi

TL;DR
The paper introduces Hoeffding Anytime Tree, an incremental decision tree algorithm that is more statistically efficient and achieves higher accuracy than existing methods like Hoeffding Tree, especially on large datasets, with minimal additional computational cost.
Contribution
It presents Hoeffding Anytime Tree, a novel incremental decision tree algorithm that outperforms Hoeffding Tree in accuracy and efficiency, and demonstrates its practical implementation and benefits.
Findings
Significantly higher prequential accuracy on large datasets
Produces the asymptotic batch tree in the limit
Resilient to concept drift
Abstract
We introduce a novel incremental decision tree learning algorithm, Hoeffding Anytime Tree, that is statistically more efficient than the current state-of-the-art, Hoeffding Tree. We demonstrate that an implementation of Hoeffding Anytime Tree---"Extremely Fast Decision Tree", a minor modification to the MOA implementation of Hoeffding Tree---obtains significantly superior prequential accuracy on most of the largest classification datasets from the UCI repository. Hoeffding Anytime Tree produces the asymptotic batch tree in the limit, is naturally resilient to concept drift, and can be used as a higher accuracy replacement for Hoeffding Tree in most scenarios, at a small additional computational cost.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
