AdaS&S: a One-Shot Supernet Approach for Automatic Embedding Size Search in Deep Recommender System
He Wei, Yuekui Yang, Yang Zhang, Haiyang Wu, Meixi Liu, Shaoping Ma

TL;DR
AdaS&S introduces a one-shot supernet framework with adaptive sampling and reinforcement learning to automatically search for optimal embedding sizes in deep recommender systems, improving performance and reducing memory usage.
Contribution
It proposes a novel one-shot search framework with adaptive sampling and resource-aware reinforcement learning for stable and efficient embedding size optimization.
Findings
Improves AUC by about 0.3% on public datasets.
Reduces embedding parameters by approximately 20%.
Achieves more stable search results than previous methods.
Abstract
Deep Learning Recommendation Model(DLRM)s utilize the embedding layer to represent various categorical features. Traditional DLRMs adopt unified embedding size for all features, leading to suboptimal performance and redundant parameters. Thus, lots of Automatic Embedding size Search (AES) works focus on obtaining mixed embedding sizes with strong model performance. However, previous AES works can hardly address several challenges together: (1) The search results of embedding sizes are unstable; (2) Recommendation effect with AES results is unsatisfactory; (3) Memory cost of embeddings is uncontrollable. To address these challenges, we propose a novel one-shot AES framework called AdaS&S, in which a supernet encompassing various candidate embeddings is built and AES is performed as searching network architectures within it. Our framework contains two main stages: In the first stage, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Recommender Systems and Techniques · Sentiment Analysis and Opinion Mining
MethodsFocus · ADaptive gradient method with the OPTimal convergence rate
