Adversarial Counterfactual Learning and Evaluation for Recommender System
Da Xu, Chuanwei Ruan, Evren Korpeoglu, Sushant Kumar, Kannan Achan

TL;DR
This paper introduces an adversarial counterfactual learning framework for recommender systems that accounts for exposure bias, providing theoretical insights, a minimax formulation, and extensive simulations to improve recommendation accuracy.
Contribution
It proposes a novel minimax empirical risk formulation and an adversarial training approach to address exposure bias in recommender systems, with theoretical guarantees.
Findings
The approach improves recommendation accuracy under exposure bias.
Theoretical bounds support the effectiveness of the method.
Simulation results demonstrate benefits across various settings.
Abstract
The feedback data of recommender systems are often subject to what was exposed to the users; however, most learning and evaluation methods do not account for the underlying exposure mechanism. We first show in theory that applying supervised learning to detect user preferences may end up with inconsistent results in the absence of exposure information. The counterfactual propensity-weighting approach from causal inference can account for the exposure mechanism; nevertheless, the partial-observation nature of the feedback data can cause identifiability issues. We propose a principled solution by introducing a minimax empirical risk formulation. We show that the relaxation of the dual problem can be converted to an adversarial game between two recommendation models, where the opponent of the candidate model characterizes the underlying exposure mechanism. We provide learning bounds and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Causal Inference Techniques · Advanced Bandit Algorithms Research
MethodsCausal inference
