Field-wise Embedding Size Search via Structural Hard Auxiliary Mask Pruning for Click-Through Rate Prediction
Tesi Xiao, Xia Xiao, Ming Chen, Youlong Chen

TL;DR
This paper introduces a novel, efficient gradient-based method for searching optimal embedding sizes in CTR prediction models by structurally pruning a super-net with auxiliary masks, improving efficiency and reducing redundancy.
Contribution
It proposes a new discrete search strategy for embedding sizes using structural pruning and auxiliary masks, addressing computational costs and search space issues in NAS-based methods.
Findings
Effectively removes redundant embedding dimensions
Maintains competitive prediction performance
Reduces computational costs during search
Abstract
Feature embeddings are one of the most essential steps when training deep learning based Click-Through Rate prediction models, which map high-dimensional sparse features to dense embedding vectors. Classic human-crafted embedding size selection methods are shown to be "sub-optimal" in terms of the trade-off between memory usage and model capacity. The trending methods in Neural Architecture Search (NAS) have demonstrated their efficiency to search for embedding sizes. However, most existing NAS-based works suffer from expensive computational costs, the curse of dimensionality of the search space, and the discrepancy between continuous search space and discrete candidate space. Other works that prune embeddings in an unstructured manner fail to reduce the computational costs explicitly. In this paper, to address those limitations, we propose a novel strategy that searches for the optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning in Materials Science · Advanced Neural Network Applications
MethodsPruning
