TL;DR
This paper presents a new parameter estimation method for Bayesian network classifiers using hierarchical Dirichlet processes, improving their accuracy and competitiveness with Random Forests on categorical datasets.
Contribution
Introduces hierarchical Dirichlet processes for more accurate parameter estimation in Bayesian network classifiers, enhancing their predictive performance.
Findings
BNCs with HDP outperform traditional methods on categorical datasets.
The proposed method maintains out-of-core learning capabilities.
Classifiers achieve competitive accuracy with Random Forests.
Abstract
This paper introduces a novel parameter estimation method for the probability tables of Bayesian network classifiers (BNCs), using hierarchical Dirichlet processes (HDPs). The main result of this paper is to show that improved parameter estimation allows BNCs to outperform leading learning methods such as Random Forest for both 0-1 loss and RMSE, albeit just on categorical datasets. As data assets become larger, entering the hyped world of "big", efficient accurate classification requires three main elements: (1) classifiers with low-bias that can capture the fine-detail of large datasets (2) out-of-core learners that can learn from data without having to hold it all in main memory and (3) models that can classify new data very efficiently. The latest Bayesian network classifiers (BNCs) satisfy these requirements. Their bias can be controlled easily by increasing the number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
