Reconsidering Learning Objectives in Unbiased Recommendation with Unobserved Confounders
Teng Xiao, Zhengyu Chen, Suhang Wang

TL;DR
This paper provides a theoretical framework linking unbiased recommendation to distribution shift, analyzes existing methods, and introduces a new adversarial self-training approach that improves recommendation fairness and accuracy.
Contribution
It offers the first theoretical analysis connecting unbiased recommendation to distribution shift and proposes a novel adversarial self-training framework for better unbiased learning.
Findings
Existing unbiased methods implicitly align biased and unbiased distributions.
Theoretical bounds explain the effectiveness of current unbiased algorithms.
AST outperforms baselines on real-world and semi-synthetic datasets.
Abstract
This work studies the problem of learning unbiased algorithms from biased feedback for recommendation. We address this problem from a novel distribution shift perspective. Recent works in unbiased recommendation have advanced the state-of-the-art with various techniques such as re-weighting, multi-task learning, and meta-learning. Despite their empirical successes, most of them lack theoretical guarantees, forming non-negligible gaps between theories and recent algorithms. In this paper, we propose a theoretical understanding of why existing unbiased learning objectives work for unbiased recommendation. We establish a close connection between unbiased recommendation and distribution shift, which shows that existing unbiased learning objectives implicitly align biased training and unbiased test distributions. Built upon this connection, we develop two generalization bounds for existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning · Recommender Systems and Techniques
