Estimating the Causal Impact of Recommendation Systems from Observational Data
Amit Sharma, Jake M. Hofman, Duncan J. Watts

TL;DR
This paper introduces a method to estimate the causal impact of recommendation systems using observational data, leveraging instrumental variables, and applies it to Amazon data to quantify the true influence of recommendations.
Contribution
The paper presents a novel approach for causal inference of recommendation effects from observational data using instrumental variables based on traffic shocks.
Findings
At least 75% of recommendation-driven traffic would occur without recommendations.
The method successfully estimates causal effects from observational browsing logs.
Recommendations significantly influence user activity, but much activity would happen independently.
Abstract
Recommendation systems are an increasingly prominent part of the web, accounting for up to a third of all traffic on several of the world's most popular sites. Nevertheless, little is known about how much activity such systems actually cause over and above activity that would have occurred via other means (e.g., search) if recommendations were absent. Although the ideal way to estimate the causal impact of recommendations is via randomized experiments, such experiments are costly and may inconvenience users. In this paper, therefore, we present a method for estimating causal effects from purely observational data. Specifically, we show that causal identification through an instrumental variable is possible when a product experiences an instantaneous shock in direct traffic and the products recommended next to it do not. We then apply our method to browsing logs containing anonymized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
