How do some Bayesian Network machine learned graphs compare to causal   knowledge?

Anthony C. Constantinou; Norman Fenton; Martin Neil

arXiv:2101.10461·cs.AI·February 3, 2021

How do some Bayesian Network machine learned graphs compare to causal knowledge?

Anthony C. Constantinou, Norman Fenton, Martin Neil

PDF

Open Access

TL;DR

This study compares purely machine learned Bayesian Network graphs with knowledge-based ones, revealing that knowledge-based graphs are more accurate predictors in limited data scenarios, despite machine learning algorithms achieving higher model scores.

Contribution

The paper provides a systematic comparison between machine learned and knowledge-based Bayesian Networks, highlighting the importance of causal knowledge in limited data contexts and evaluating the effectiveness of various learning algorithms.

Findings

01

Knowledge-based graphs are more accurate predictors with limited data.

02

Machine learning algorithms achieve higher model scores but deviate from true graphs.

03

Simulated data results do not reliably predict real-world performance.

Abstract

The graph of a Bayesian Network (BN) can be machine learned, determined by causal knowledge, or a combination of both. In disciplines like bioinformatics, applying BN structure learning algorithms can reveal new insights that would otherwise remain unknown. However, these algorithms are less effective when the input data are limited in terms of sample size, which is often the case when working with real data. This paper focuses on purely machine learned and purely knowledge-based BNs and investigates their differences in terms of graphical structure and how well the implied statistical models explain the data. The tests are based on four previous case studies whose BN structure was determined by domain knowledge. Using various metrics, we compare the knowledge-based graphs to the machine learned graphs generated from various algorithms implemented in TETRAD spanning all three classes of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Data Quality and Management · Data Mining Algorithms and Applications