An Efficient Search Strategy for Aggregation and Discretization of Attributes of Bayesian Networks Using Minimum Description Length
Jem Corcoran, Daniel Tran, Nicholas Levine

TL;DR
This paper introduces an efficient search strategy for discretizing attributes in Bayesian networks, enabling the practical application of a minimum description length-based discretization method that preserves data information.
Contribution
It presents a novel, highly efficient search algorithm that makes the MDL-based discretization approach feasible for larger Bayesian networks.
Findings
The proposed method significantly reduces computational complexity.
It effectively preserves data information during discretization.
The approach is applicable to large-scale Bayesian network structure recovery.
Abstract
Bayesian networks are convenient graphical expressions for high dimensional probability distributions representing complex relationships between a large number of random variables. They have been employed extensively in areas such as bioinformatics, artificial intelligence, diagnosis, and risk management. The recovery of the structure of a network from data is of prime importance for the purposes of modeling, analysis, and prediction. Most recovery algorithms in the literature assume either discrete of continuous but Gaussian data. For general continuous data, discretization is usually employed but often destroys the very structure one is out to recover. Friedman and Goldszmidt suggest an approach based on the minimum description length principle that chooses a discretization which preserves the information in the original data set, however it is one which is difficult, if not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Gaussian Processes and Bayesian Inference · Bayesian Methods and Mixture Models
