Cross-Dataset Propensity Estimation for Debiasing Recommender Systems

Fengyu Li; Sarah Dean

arXiv:2212.13892·cs.IR·December 29, 2022

Cross-Dataset Propensity Estimation for Debiasing Recommender Systems

Fengyu Li, Sarah Dean

PDF

Open Access

TL;DR

This paper proposes a method to reduce distribution shift in recommender system datasets caused by selection bias by leveraging two differently quantized datasets and applying inverse probability scoring, leading to improved performance.

Contribution

It introduces a novel approach using two datasets with different quantizations and inverse probability scoring to mitigate selection bias in recommender systems.

Findings

01

Significant performance improvements over single-dataset methods

02

Effective reduction of distribution shift caused by selection bias

03

Demonstrated robustness across different dataset quantizations

Abstract

Datasets for training recommender systems are often subject to distribution shift induced by users' and recommenders' selection biases. In this paper, we study the impact of selection bias on datasets with different quantization. We then leverage two differently quantized datasets from different source distributions to mitigate distribution shift by applying the inverse probability scoring method from causal inference. Empirically, our approach gains significant performance improvement over single-dataset methods and alternative ways of combining two datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Recommender Systems and Techniques · Machine Learning and Data Classification