Scalable Cross-Entropy Loss for Sequential Recommendations with Large   Item Catalogs

Gleb Mezentsev; Danil Gusak; Ivan Oseledets; Evgeny Frolov

arXiv:2409.18721·cs.IR·December 3, 2024

Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs

Gleb Mezentsev, Danil Gusak, Ivan Oseledets, Evgeny Frolov

PDF

1 Repo

TL;DR

This paper proposes a scalable Cross-Entropy loss function for sequential recommendation systems with large item catalogs, significantly reducing memory usage while maintaining or improving recommendation quality.

Contribution

It introduces a novel SCE loss that approximates traditional CE efficiently, enabling large-scale recommendations without high GPU memory consumption.

Findings

01

Reduces peak memory usage by up to 100 times

02

Maintains or exceeds recommendation quality metrics

03

Effective for large-scale datasets and models

Abstract

Scalability issue plays a crucial role in productionizing modern recommender systems. Even lightweight architectures may suffer from high computational overload due to intermediate calculations, limiting their practicality in real-world applications. Specifically, applying full Cross-Entropy (CE) loss often yields state-of-the-art performance in terms of recommendations quality. Still, it suffers from excessive GPU memory utilization when dealing with large item catalogs. This paper introduces a novel Scalable Cross-Entropy (SCE) loss function in the sequential learning setup. It approximates the CE loss for datasets with large-size catalogs, enhancing both time efficiency and memory usage without compromising recommendations quality. Unlike traditional negative sampling methods, our approach utilizes a selective GPU-efficient computation strategy, focusing on the most informative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AIRI-Institute/Scalable-SASRec
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax