TL;DR
This paper introduces oblique predictive clustering trees that efficiently handle high-dimensional and sparse data, achieving comparable predictive performance to state-of-the-art methods while significantly reducing training time.
Contribution
The paper proposes novel methods for learning oblique splits in PCTs, improving scalability and efficiency in high-dimensional, sparse data scenarios.
Findings
Achieve performance comparable to state-of-the-art methods.
Order of magnitude faster training times than standard PCTs.
Effective extraction of feature importance scores.
Abstract
Predictive clustering trees (PCTs) are a well established generalization of standard decision trees, which can be used to solve a variety of predictive modeling tasks, including structured output prediction. Combining them into ensembles yields state-of-the-art performance. Furthermore, the ensembles of PCTs can be interpreted by calculating feature importance scores from the learned models. However, their learning time scales poorly with the dimensionality of the output space. This is often problematic, especially in (hierarchical) multi-label classification, where the output can consist of hundreds of potential labels. Also, learning of PCTs can not exploit the sparsity of data to improve the computational efficiency, which is common in both input (molecular fingerprints, bag of words representations) and output spaces (in multi-label classification, examples are often labeled with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
