Learning Bounded Treewidth Bayesian Networks with Thousands of Variables

Mauro Scanagatta; Giorgio Corani; Cassio P. de Campos; Marco Zaffalon

arXiv:1605.03392·cs.AI·May 12, 2016

Learning Bounded Treewidth Bayesian Networks with Thousands of Variables

Mauro Scanagatta, Giorgio Corani, Cassio P. de Campos, Marco Zaffalon

PDF

Open Access

TL;DR

This paper introduces a scalable algorithm for learning Bayesian networks with bounded treewidth from large datasets, significantly improving efficiency and performance over existing methods.

Contribution

A novel algorithm capable of learning large-scale bounded treewidth Bayesian networks, outperforming current state-of-the-art approaches on datasets with up to ten thousand variables.

Findings

01

Outperforms existing methods on large datasets

02

Scales effectively to thousands of variables

03

Maintains low inference complexity

Abstract

We present a method for learning treewidth-bounded Bayesian networks from data sets containing thousands of variables. Bounding the treewidth of a Bayesian greatly reduces the complexity of inferences. Yet, being a global property of the graph, it considerably increases the difficulty of the learning process. We propose a novel algorithm for this task, able to scale to large domains and large treewidths. Our novel approach consistently outperforms the state of the art on data sets with up to ten thousand variables.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Data Quality and Management · Data Management and Algorithms