Conditional Density Estimation with Histogram Trees
Lincen Yang, Matthijs van Leeuwen

TL;DR
This paper introduces Conditional Density Trees (CDTrees), a non-parametric, interpretable tree-based model for conditional density estimation that outperforms existing methods in accuracy and robustness.
Contribution
The paper proposes CDTrees, a novel tree-based approach for CDE that uses histogram leaves and MDL for hyperparameter-free learning, enhancing interpretability and performance.
Findings
CDTrees outperform existing interpretable CDE methods in log-loss.
CDTrees are more robust against irrelevant features.
CDTrees produce smaller, more interpretable trees.
Abstract
Conditional density estimation (CDE) goes beyond regression by modeling the full conditional distribution, providing a richer understanding of the data than just the conditional mean in regression. This makes CDE particularly useful in critical application domains. However, interpretable CDE methods are understudied. Current methods typically employ kernel-based approaches, using kernel functions directly for kernel density estimation or as basis functions in linear models. In contrast, despite their conceptual simplicity and visualization suitability, tree-based methods -- which are arguably more comprehensible -- have been largely overlooked for CDE tasks. Thus, we propose the Conditional Density Tree (CDTree), a fully non-parametric model consisting of a decision tree in which each leaf is formed by a histogram model. Specifically, we formalize the problem of learning a CDTree using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsData Management and Algorithms · Bayesian Methods and Mixture Models · Anomaly Detection Techniques and Applications
Methods+ ( 1 ) ⟷ 888 ⟷ ( 829 ) ⟷ 0881||How do I resolve a dispute on Expedia?
