Oblique Predictive Clustering Trees

Toma\v{z} Stepi\v{s}nik; Dragi Kocev

arXiv:2007.13617·cs.LG·November 6, 2020

Oblique Predictive Clustering Trees

Toma\v{z} Stepi\v{s}nik, Dragi Kocev

PDF

1 Repo

TL;DR

This paper introduces oblique predictive clustering trees that efficiently handle high-dimensional and sparse data, achieving comparable predictive performance to state-of-the-art methods while significantly reducing training time.

Contribution

The paper proposes novel methods for learning oblique splits in PCTs, improving scalability and efficiency in high-dimensional, sparse data scenarios.

Findings

01

Achieve performance comparable to state-of-the-art methods.

02

Order of magnitude faster training times than standard PCTs.

03

Effective extraction of feature importance scores.

Abstract

Predictive clustering trees (PCTs) are a well established generalization of standard decision trees, which can be used to solve a variety of predictive modeling tasks, including structured output prediction. Combining them into ensembles yields state-of-the-art performance. Furthermore, the ensembles of PCTs can be interpreted by calculating feature importance scores from the learned models. However, their learning time scales poorly with the dimensionality of the output space. This is often problematic, especially in (hierarchical) multi-label classification, where the output can consist of hundreds of potential labels. Also, learning of PCTs can not exploit the sparsity of data to improve the computational efficiency, which is common in both input (molecular fingerprints, bag of words representations) and output spaces (in multi-label classification, examples are often labeled with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/TStepi/spyct
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.