KuaiRec: A Fully-observed Dataset and Insights for Evaluating   Recommender Systems

Chongming Gao; Shijun Li; Wenqiang Lei; Jiawei Chen; Biao Li; Peng; Jiang; Xiangnan He; Jiaxin Mao; Tat-Seng Chua

arXiv:2202.10842·cs.IR·August 19, 2022

KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems

Chongming Gao, Shijun Li, Wenqiang Lei, Jiawei Chen, Biao Li, Peng, Jiang, Xiangnan He, Jiaxin Mao, Tat-Seng Chua

PDF

3 Repos

TL;DR

This paper introduces KuaiRec, a fully-observed user-item interaction dataset from Kuaishou, enabling more accurate offline evaluation of recommender systems and revealing how data density and exposure bias influence evaluation outcomes.

Contribution

It provides the first real-world fully-observed dataset for recommender systems, facilitating better understanding of evaluation biases and their impact on method performance.

Findings

01

Method rankings vary with data density and exposure bias.

02

Estimating missing interactions can mitigate evaluation discrepancies.

03

Fully-observed data is crucial for reliable recommender system evaluation.

Abstract

The progress of recommender systems is hampered mainly by evaluation as it requires real-time interactions between humans and systems, which is too laborious and expensive. This issue is usually approached by utilizing the interaction history to conduct offline evaluation. However, existing datasets of user-item interactions are partially observed, leaving it unclear how and to what extent the missing interactions will influence the evaluation. To answer this question, we collect a fully-observed dataset from Kuaishou's online environment, where almost all 1,411 users have been exposed to all 3,327 items. To the best of our knowledge, this is the first real-world fully-observed data with millions of user-item interactions. With this unique dataset, we conduct a preliminary analysis of how the two factors - data density and exposure bias - affect the evaluation results of multi-round…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.