TL;DR
OptEmbed introduces a unified framework for learning optimal, compact embedding tables in CTR prediction models by pruning redundant features and efficiently searching for the best embedding dimensions, leading to improved performance.
Contribution
The paper proposes OptEmbed, a novel method that jointly prunes and searches for optimal embedding dimensions using a supernet and evolution search, surpassing existing methods in efficiency and effectiveness.
Findings
OptEmbed produces more compact embedding tables.
It improves CTR prediction performance.
It reduces memory usage significantly.
Abstract
Learning embedding table plays a fundamental role in Click-through rate(CTR) prediction from the view of the model performance and memory usage. The embedding table is a two-dimensional tensor, with its axes indicating the number of feature values and the embedding dimension, respectively. To learn an efficient and effective embedding table, recent works either assign various embedding dimensions for feature fields and reduce the number of embeddings respectively or mask the embedding table parameters. However, all these existing works cannot get an optimal embedding table. On the one hand, various embedding dimensions still require a large amount of memory due to the vast number of features in the dataset. On the other hand, decreasing the number of embeddings usually suffers from performance degradation, which is intolerable in CTR prediction. Finally, pruning embedding parameters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning · Balanced Selection
