Embedding Compression in Recommender Systems: A Survey

Shiwei Li; Huifeng Guo; Xing Tang; Ruiming Tang; Lu Hou; Ruixuan Li,; Rui Zhang

arXiv:2408.02304·cs.IR·August 7, 2024

Embedding Compression in Recommender Systems: A Survey

Shiwei Li, Huifeng Guo, Xing Tang, Ruiming Tang, Lu Hou, Ruixuan Li,, Rui Zhang

PDF

TL;DR

This survey reviews various embedding compression techniques in recommender systems, aiming to reduce memory usage and improve efficiency by categorizing approaches into low-precision, mixed-dimension, and weight-sharing methods.

Contribution

It provides a comprehensive classification and analysis of existing embedding compression methods in recommender systems, highlighting future research directions.

Findings

01

Embedding compression reduces memory costs in recommender systems.

02

Three main categories of compression techniques are identified.

03

Future prospects include developing more efficient and scalable methods.

Abstract

To alleviate the problem of information explosion, recommender systems are widely deployed to provide personalized information filtering services. Usually, embedding tables are employed in recommender systems to transform high-dimensional sparse one-hot vectors into dense real-valued embeddings. However, the embedding tables are huge and account for most of the parameters in industrial-scale recommender systems. In order to reduce memory costs and improve efficiency, various approaches are proposed to compress the embedding tables. In this survey, we provide a comprehensive review of embedding compression approaches in recommender systems. We first introduce deep learning recommendation models and the basic concept of embedding compression in recommender systems. Subsequently, we systematically organize existing approaches into three categories, namely low-precision, mixed-dimension,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.