Causal Discovery on Dependent Binary Data

Alex Chen; Qing Zhou

arXiv:2412.20289·cs.LG·December 31, 2024

Causal Discovery on Dependent Binary Data

Alex Chen, Qing Zhou

PDF

Open Access

TL;DR

This paper introduces a decorrelation-based method for causal discovery in dependent binary data, addressing the challenge of non-independent observations by estimating and removing dependence to improve causal graph learning accuracy.

Contribution

It proposes a novel pairwise maximum likelihood and EM-like algorithm to decorrelate dependent binary data for more accurate causal discovery.

Findings

01

Improved causal graph learning accuracy on synthetic data

02

Effective decorrelation of dependent binary data demonstrated

03

Method outperforms traditional approaches on real-world datasets

Abstract

The assumption of independence between observations (units) in a dataset is prevalent across various methodologies for learning causal graphical models. However, this assumption often finds itself in conflict with real-world data, posing challenges to accurate structure learning. We propose a decorrelation-based approach for causal graph learning on dependent binary data, where the local conditional distribution is defined by a latent utility model with dependent errors across units. We develop a pairwise maximum likelihood method to estimate the covariance matrix for the dependence among the units. Then, leveraging the estimated covariance matrix, we develop an EM-like iterative algorithm to generate and decorrelate samples of the latent utility variables, which serve as decorrelated data. Any standard causal discovery method can be applied on the decorrelated data to learn the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Bayesian Modeling and Causal Inference · Rough Sets and Fuzzy Logic