TL;DR
This paper provides theoretical guarantees for top-down decision tree algorithms in the agnostic setting, showing they can approximate the optimal tree error with provable bounds, unlike prior work focused on realizable cases.
Contribution
It establishes the first provable performance guarantees for top-down decision tree heuristics in the agnostic setting, including upper and near-matching lower bounds.
Findings
Decision trees of size s^{ ilde{O}(( ext{log} s)/ ext{ε}^2)} achieve error close to the optimal.
Prior guarantees did not exist for agnostic decision tree learning.
A near-matching lower bound of s^{ ilde{ ext{Omega}}( ext{log} s)} is shown.
Abstract
We give strengthened provable guarantees on the performance of widely employed and empirically successful {\sl top-down decision tree learning heuristics}. While prior works have focused on the realizable setting, we consider the more realistic and challenging {\sl agnostic} setting. We show that for all monotone functions~ and parameters , these heuristics construct a decision tree of size that achieves error , where denotes the error of the optimal size- decision tree for . Previously, such a guarantee was not known to be achievable by any algorithm, even one that is not based on top-down heuristics. We complement our algorithmic guarantee with a near-matching lower bound.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
