Dis-entangling Mixture of Interventions on a Causal Bayesian Network Using Aggregate Observations
Gaurav Sinha, Ayush Chauhan, Aurghya Maiti, Naman Poddar, Pulkit Goel

TL;DR
This paper addresses the challenge of identifying mixing proportions of interventions in a causal Bayesian network using aggregate observational data, providing theoretical insights and practical algorithms for exact and approximate recovery.
Contribution
It offers a constructive proof of identifiability under certain conditions and develops an efficient algorithm for recovering mixing proportions from aggregate data.
Findings
Exact recovery is possible when marginals are known and certain distributions are excluded.
An optimization framework effectively estimates mixing proportions when marginals are approximate.
Experimental validation on ALARM and e-commerce data demonstrates the method's practical utility.
Abstract
We study the problem of separating a mixture of distributions, all of which come from interventions on a known causal bayesian network. Given oracle access to marginals of all distributions resulting from interventions on the network, and estimates of marginals from the mixture distribution, we want to recover the mixing proportions of different mixture components. We show that in the worst case, mixing proportions cannot be identified using marginals only. If exact marginals of the mixture distribution were known, under a simple assumption of excluding a few distributions from the mixture, we show that the mixing proportions become identifiable. Our identifiability proof is constructive and gives an efficient algorithm recovering the mixing proportions exactly. When exact marginals are not available, we design an optimization framework to estimate the mixing proportions. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Data Quality and Management
