Towards Reliable Item Sampling for Recommendation Evaluation

Dong Li; Ruoming Jin; Zhenming Liu; Bin Ren; Jing Gao; Zhi Liu

arXiv:2211.15743·cs.IR·October 12, 2023

Towards Reliable Item Sampling for Recommendation Evaluation

Dong Li, Ruoming Jin, Zhenming Liu, Bin Ren, Jing Gao, Zhi Liu

PDF

Open Access 1 Video

TL;DR

This paper introduces new sampling estimators and adaptive methods to improve the accuracy and reliability of recommendation evaluation metrics, addressing theoretical gaps and the 'blind spot' issue in item sampling.

Contribution

It proposes a novel item-sampling estimator with theoretical error optimization and an adaptive sampling approach to mitigate the 'blind spot' problem, enhancing evaluation reliability.

Findings

01

The new estimator outperforms previous methods in accuracy.

02

The adaptive sampling method effectively addresses the 'blind spot' issue.

03

Experimental results validate the theoretical analysis and improvements.

Abstract

Since Rendle and Krichene argued that commonly used sampling-based evaluation metrics are "inconsistent" with respect to the global metrics (even in expectation), there have been a few studies on the sampling-based recommender system evaluation. Existing methods try either mapping the sampling-based metrics to their global counterparts or more generally, learning the empirical rank distribution to estimate the top- $K$ metrics. However, despite existing efforts, there is still a lack of rigorous theoretical understanding of the proposed metric estimators, and the basic item sampling also suffers from the "blind spot" issue, i.e., estimation accuracy to recover the top- $K$ metrics when $K$ is small can still be rather substantial. In this paper, we provide an in-depth investigation into these problems and make two innovative contributions. First, we propose a new item-sampling estimator…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards Reliable Item Sampling for Recommendation Evaluation· underline

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Stochastic Gradient Optimization Techniques