Superconstant Inapproximability of Decision Tree Learning

Caleb Koch; Carmen Strassle; Li-Yang Tan

arXiv:2407.01402·cs.CC·July 2, 2024

Superconstant Inapproximability of Decision Tree Learning

Caleb Koch, Carmen Strassle, Li-Yang Tan

PDF

Open Access

TL;DR

This paper proves that properly PAC learning decision trees remains NP-hard even when the hypothesis tree is only required to be within a constant factor of the optimal size, establishing superconstant inapproximability.

Contribution

It demonstrates NP-hardness of near-optimal decision tree learning, introduces a new simpler proof of existing hardness, and develops a novel XOR lemma with sharp parameters.

Findings

01

NP-hardness persists within any constant factor of optimal decision trees

02

Introduces a new, simpler proof of decision tree learning hardness

03

Develops a new XOR lemma with sharp parameters for decision trees

Abstract

We consider the task of properly PAC learning decision trees with queries. Recent work of Koch, Strassle, and Tan showed that the strictest version of this task, where the hypothesis tree $T$ is required to be optimally small, is NP-hard. Their work leaves open the question of whether the task remains intractable if $T$ is only required to be close to optimal, say within a factor of 2, rather than exactly optimal. We answer this affirmatively and show that the task indeed remains NP-hard even if $T$ is allowed to be within any constant factor of optimal. More generally, our result allows for a smooth tradeoff between the hardness assumption and the inapproximability factor. As Koch et al.'s techniques do not appear to be amenable to such a strengthening, we first recover their result with a new and simpler proof, which we couple with a new XOR lemma for decision trees. While there is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Rough Sets and Fuzzy Logic · Fuzzy Logic and Control Systems