OptEmbed: Learning Optimal Embedding Table for Click-through Rate   Prediction

Fuyuan Lyu; Xing Tang; Hong Zhu; Huifeng Guo; Yingxue Zhang; Ruiming; Tang; Xue Liu

arXiv:2208.04482·cs.IR·September 7, 2022

OptEmbed: Learning Optimal Embedding Table for Click-through Rate Prediction

Fuyuan Lyu, Xing Tang, Hong Zhu, Huifeng Guo, Yingxue Zhang, Ruiming, Tang, Xue Liu

PDF

1 Repo

TL;DR

OptEmbed introduces a unified framework for learning optimal, compact embedding tables in CTR prediction models by pruning redundant features and efficiently searching for the best embedding dimensions, leading to improved performance.

Contribution

The paper proposes OptEmbed, a novel method that jointly prunes and searches for optimal embedding dimensions using a supernet and evolution search, surpassing existing methods in efficiency and effectiveness.

Findings

01

OptEmbed produces more compact embedding tables.

02

It improves CTR prediction performance.

03

It reduces memory usage significantly.

Abstract

Learning embedding table plays a fundamental role in Click-through rate(CTR) prediction from the view of the model performance and memory usage. The embedding table is a two-dimensional tensor, with its axes indicating the number of feature values and the embedding dimension, respectively. To learn an efficient and effective embedding table, recent works either assign various embedding dimensions for feature fields and reduce the number of embeddings respectively or mask the embedding table parameters. However, all these existing works cannot get an optimal embedding table. On the one hand, various embedding dimensions still require a large amount of memory due to the vast number of features in the dataset. On the other hand, decreasing the number of embeddings usually suffers from performance degradation, which is intolerable in CTR prediction. Finally, pruning embedding parameters…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fuyuanlyu/optembed
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPruning · Balanced Selection