What can the millions of random treatments in nonexperimental data reveal about causes?
Andre F. Ribeiro, Frank Neffke, Ricardo Hausmann

TL;DR
This paper introduces a Bayesian method leveraging pairwise differences in nonexperimental data to estimate causal effects more accurately, expanding observational data into a quasi-experimental framework and recovering individual effects.
Contribution
It proposes a novel Bayesian approach that combines pairwise treatment differences to improve causal inference from large nonexperimental datasets.
Findings
Outperforms traditional estimators in simulations and real data.
Recovers individual causal effects in heterogeneous populations.
Effectively scales to large observational datasets.
Abstract
We propose a new method to estimate causal effects from nonexperimental data. Each pair of sample units is first associated with a stochastic 'treatment' - differences in factors between units - and an effect - a resultant outcome difference. It is then proposed that all such pairs can be combined to provide more accurate estimates of causal effects in observational data, provided a statistical model connecting combinatorial properties of treatments to the accuracy and unbiasedness of their effects. The article introduces one such model and a Bayesian approach to combine the pairwise observations typically available in nonexperimnetal data. This also leads to an interpretation of nonexperimental datasets as incomplete, or noisy, versions of ideal factorial experimental designs. This approach to causal effect estimation has several advantages: (1) it expands the number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques
