Approach of variable clustering and compression for learning large   Bayesian networks

Anna V. Bubnova

arXiv:2208.13605·stat.ML·August 30, 2022

Approach of variable clustering and compression for learning large Bayesian networks

Anna V. Bubnova

PDF

Open Access

TL;DR

This paper introduces a novel method for learning large Bayesian network structures by clustering features and using compressed information to enable faster, potentially parallelized structure learning with maintained accuracy.

Contribution

It presents a new approach combining feature space clustering with information compression to improve the efficiency of large Bayesian network structure learning.

Findings

01

Enhanced speed of structure learning.

02

Maintained accuracy with compressed data.

03

Applicable to parallel processing environments.

Abstract

This paper describes a new approach for learning structures of large Bayesian networks based on blocks resulting from feature space clustering. This clustering is obtained using normalized mutual information. And the subsequent aggregation of blocks is done using classical learning methods except that they are input with compressed information about combinations of feature values for each block. Validation of this approach is done for Hill-Climbing as a graph enumeration algorithm for two score functions: BIC and MI. In this way, potentially parallelizable block learning can be implemented even for those score functions that are considered unsuitable for parallelizable learning. The advantage of the approach is evaluated in terms of speed of work as well as the accuracy of the found structures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Data Mining Algorithms and Applications · Rough Sets and Fuzzy Logic

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings