Causal Inference Struggles with Agency on Online Platforms
Smitha Milli, Luca Belli, Moritz Hardt

TL;DR
This study evaluates the reliability of observational studies based on user self-selection on online platforms, finding they poorly replicate randomized experiment results and often produce opposite effects, questioning their validity.
Contribution
It provides large-scale empirical evidence that observational studies from user self-selection are unreliable for causal inference on online platforms, contrasting with randomized experiments.
Findings
Observational estimates poorly replicate randomized results.
Most observational estimates have opposite signs to randomized effects.
User self-selection leads to unreliable causal inferences.
Abstract
Online platforms regularly conduct randomized experiments to understand how changes to the platform causally affect various outcomes of interest. However, experimentation on online platforms has been criticized for having, among other issues, a lack of meaningful oversight and user consent. As platforms give users greater agency, it becomes possible to conduct observational studies in which users self-select into the treatment of interest as an alternative to experiments in which the platform controls whether the user receives treatment or not. In this paper, we conduct four large-scale within-study comparisons on Twitter aimed at assessing the effectiveness of observational studies derived from user self-selection on online platforms. In a within-study comparison, treatment effects from an observational study are assessed based on how effectively they replicate results from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
