Rethinking Collaborative Metric Learning: Toward an Efficient   Alternative without Negative Sampling

Shilong Bao; Qianqian Xu; Zhiyong Yang; Xiaochun Cao; Qingming Huang

arXiv:2206.11549·cs.LG·June 24, 2022

Rethinking Collaborative Metric Learning: Toward an Efficient Alternative without Negative Sampling

Shilong Bao, Qianqian Xu, Zhiyong Yang, Xiaochun Cao, Qingming Huang

PDF

1 Repo

TL;DR

This paper analyzes the bias introduced by negative sampling in Collaborative Metric Learning (CML) for recommendation systems and proposes a sampling-free alternative, SFCML, demonstrating improved performance without sampling bias.

Contribution

The paper provides a theoretical analysis of negative sampling bias in CML and introduces a novel sampling-free method, SFCML, to improve generalization in recommendation systems.

Findings

01

Negative sampling introduces bias in CML's generalization error.

02

Sampling-free CML (SFCML) eliminates bias caused by negative sampling.

03

SFCML outperforms traditional CML on seven benchmark datasets.

Abstract

The recently proposed Collaborative Metric Learning (CML) paradigm has aroused wide interest in the area of recommendation systems (RS) owing to its simplicity and effectiveness. Typically, the existing literature of CML depends largely on the \textit{negative sampling} strategy to alleviate the time-consuming burden of pairwise computation. However, in this work, by taking a theoretical analysis, we find that negative sampling would lead to a biased estimation of the generalization error. Specifically, we show that the sampling-based CML would introduce a bias term in the generalization bound, which is quantified by the per-user \textit{Total Variance} (TV) between the distribution induced by negative sampling and the ground truth distribution. This suggests that optimizing the sampling-based CML loss function does not ensure a small generalization error even with sufficiently large…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

statusrank/LibCML
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.