TL;DR
This paper introduces the split-door criterion, a novel method for identifying causal effects in time series data by leveraging auxiliary outcomes that are independent of the cause, enabling more accurate causal inference.
Contribution
The paper presents a new approach that reduces causal identification to independence testing in split outcomes, applicable to large-scale, real-world datasets like Amazon's recommender system.
Findings
The method successfully identified causal impacts on Amazon traffic data.
Estimated that 50-80% of traffic attributed to recommendations could occur without them.
The approach generalizes beyond natural experiments to broader data settings.
Abstract
We present a method for estimating causal effects in time series data when fine-grained information about the outcome of interest is available. Specifically, we examine what we call the split-door setting, where the outcome variable can be split into two parts: one that is potentially affected by the cause being studied and another that is independent of it, with both parts sharing the same (unobserved) confounders. We show that under these conditions, the problem of identification reduces to that of testing for independence among observed variables, and present a method that uses this approach to automatically find subsets of the data that are causally identified. We demonstrate the method by estimating the causal impact of Amazon's recommender system on traffic to product pages, finding thousands of examples within the dataset that satisfy the split-door criterion. Unlike past studies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
