Revisiting Alternative Experimental Settings for Evaluating Top-N Item   Recommendation Algorithms

Wayne Xin Zhao; Junhua Chen; Pengfei Wang; Qi Gu; Ji-Rong Wen

arXiv:2010.04484·cs.IR·October 12, 2020

Revisiting Alternative Experimental Settings for Evaluating Top-N Item Recommendation Algorithms

Wayne Xin Zhao, Junhua Chen, Pengfei Wang, Qi Gu, Ji-Rong Wen

PDF

Open Access

TL;DR

This paper critically examines and compares different experimental setups for evaluating top-N item recommendation algorithms, providing guidelines to improve assessment accuracy.

Contribution

It systematically analyzes dataset splitting, sampled metrics, and domain selection, offering practical recommendations for more reliable evaluation of recommendation algorithms.

Findings

01

Different experimental settings significantly affect evaluation outcomes.

02

Recommendations for dataset splitting and metric sampling improve assessment reliability.

03

Guidelines help standardize evaluation practices in top-N recommendation research.

Abstract

Top-N item recommendation has been a widely studied task from implicit feedback. Although much progress has been made with neural methods, there is increasing concern on appropriate evaluation of recommendation algorithms. In this paper, we revisit alternative experimental settings for evaluating top-N recommendation algorithms, considering three important factors, namely dataset splitting, sampled metrics and domain selection. We select eight representative recommendation algorithms (covering both traditional and neural methods) and construct extensive experiments on a very large dataset. By carefully revisiting different options, we make several important findings on the three factors, which directly provide useful suggestions on how to appropriately set up the experiments for top-N item recommendation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Advanced Graph Neural Networks