Investigating the effect of binning on causal discovery

Andrew Colt Deckert; Erich Kummerfeld

arXiv:2202.11789·cs.LG·February 25, 2022

Investigating the effect of binning on causal discovery

Andrew Colt Deckert, Erich Kummerfeld

PDF

1 Repo

TL;DR

This study investigates how binning continuous data affects the performance of the GES causal discovery algorithm, revealing that unbinned data generally perform better but are more sensitive to sample size and parameters.

Contribution

It provides the first systematic analysis of binning effects on causal discovery algorithms, highlighting the conditions under which binning impacts performance.

Findings

01

Unbinned data often yield higher search performance.

02

Binned data are more sensitive to sample size and tuning parameters.

03

Interactive effects exist between sample size, binning, and tuning parameters.

Abstract

Binning (a.k.a. discretization) of numerically continuous measurements is a wide-spread but controversial practice in data collection, analysis, and presentation. The consequences of binning have been evaluated for many different kinds of data analysis methods, however so far the effect of binning on causal discovery algorithms has not been directly investigated. This paper reports the results of a simulation study that examined the effect of binning on the Greedy Equivalence Search (GES) causal discovery algorithm. Our findings suggest that unbinned continuous data often result in the highest search performance, but some exceptions are identified. We also found that binned data are more sensitive to changes in sample size and tuning parameters, and identified some interactive effects between sample size, binning, and tuning parameter on performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cdecker8/Investigating-the-effect-of-binning-on-causal-discovery-online-supplemental-information
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.