Self-supervised Learning for Large-scale Item Recommendations

Tiansheng Yao; Xinyang Yi; Derek Zhiyuan Cheng; Felix Yu; Ting Chen,; Aditya Menon; Lichan Hong; Ed H. Chi; Steve Tjoa; Jieqi Kang; Evan Ettinger

arXiv:2007.12865·cs.LG·February 26, 2021·29 cites

Self-supervised Learning for Large-scale Item Recommendations

Tiansheng Yao, Xinyang Yi, Derek Zhiyuan Cheng, Felix Yu, Ting Chen,, Aditya Menon, Lichan Hong, Ed H. Chi, Steve Tjoa, Jieqi Kang, Evan Ettinger

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-supervised learning framework for large-scale item recommendation systems to address data sparsity, improve item representations, and enhance overall model performance, validated through real-world datasets and online experiments.

Contribution

It proposes a novel multi-task self-supervised learning framework with a feature correlation-based data augmentation method for large-scale recommender models, improving generalization and performance.

Findings

01

Superior performance over state-of-the-art regularization techniques

02

Effective in handling long-tail item feedback sparsity

03

Significant improvements in online business metrics

Abstract

Large scale recommender models find most relevant items from huge catalogs, and they play a critical role in modern search and recommendation systems. To model the input space with large-vocab categorical features, a typical recommender model learns a joint embedding space through neural networks for both queries and items from user feedback data. However, with millions to billions of items in the corpus, users tend to provide feedback for a very small set of them, causing a power-law distribution. This makes the feedback data for long-tail items extremely sparse. Inspired by the recent success in self-supervised representation learning research in both computer vision and natural language understanding, we propose a multi-task self-supervised learning (SSL) framework for large-scale item recommendations. The framework is designed to tackle the label sparsity problem by learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

massquantity/LibRecommender
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsDropout