Learning Small Decision Trees with Few Outliers: A Parameterized Perspective
Harmender Gahlawat, Meirav Zehavi

TL;DR
This paper investigates the parameterized complexity of constructing small decision trees with few outliers, establishing hardness results and fixed-parameter tractability under certain parameters.
Contribution
It introduces a generalized decision tree learning problem considering outliers and provides complexity classifications, including W[1]-hardness and fixed-parameter tractability results.
Findings
DTSO and DTDO are W[1]-hard with respect to size and depth parameters plus feature differences.
Both problems are fixed-parameter tractable when parameterized by the number of outliers t.
The paper presents kernelization complexity results, showing both positive and negative outcomes.
Abstract
Decision trees are a fundamental tool in machine learning for representing, classifying, and generalizing data. It is desirable to construct ``small'' decision trees, by minimizing either the \textit{size} () or the \textit{depth} of the \textit{decision tree} (\textsc{DT}). Recently, the parameterized complexity of \textsc{Decision Tree Learning} has attracted a lot of attention. We consider a generalization of \textsc{Decision Tree Learning} where given a \textit{classification instance} and an integer , the task is to find a ``small'' \textsc{DT} that disagrees with in at most examples. We consider two problems: \textsc{DTSO} and \textsc{DTDO}, where the goal is to construct a \textsc{DT} minimizing and , respectively. We first establish that both \textsc{DTSO} and \textsc{DTDO} are W[1]-hard when parameterized by and ,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFuzzy Logic and Control Systems · Data Mining Algorithms and Applications · Statistical and Computational Modeling
