Learning Effective and Efficient Embedding via an Adaptively-Masked   Twins-based Layer

Bencheng Yan; Pengjie Wang; Kai Zhang; Wei Lin; Kuang-Chih Lee; Jian; Xu; Bo Zheng

arXiv:2108.11513·cs.LG·August 27, 2021·1 cites

Learning Effective and Efficient Embedding via an Adaptively-Masked Twins-based Layer

Bencheng Yan, Pengjie Wang, Kai Zhang, Wei Lin, Kuang-Chih Lee, Jian, Xu, Bo Zheng

PDF

Open Access

TL;DR

This paper introduces an adaptively-masked twins-based layer (AMTL) for embedding size selection in deep learning recommendation models, improving efficiency and flexibility while reducing memory usage by 60%.

Contribution

It proposes a novel AMTL layer that adaptively masks embedding dimensions, enabling flexible and memory-efficient embedding size selection in DLRMs.

Findings

01

Outperforms baseline methods on benchmark tasks.

02

Reduces memory usage by 60%.

03

Maintains performance metrics with improved efficiency.

Abstract

Embedding learning for categorical features is crucial for the deep learning-based recommendation models (DLRMs). Each feature value is mapped to an embedding vector via an embedding learning process. Conventional methods configure a fixed and uniform embedding size to all feature values from the same feature field. However, such a configuration is not only sub-optimal for embedding learning but also memory costly. Existing methods that attempt to resolve these problems, either rule-based or neural architecture search (NAS)-based, need extensive efforts on the human design or network training. They are also not flexible in embedding size selection or in warm-start-based applications. In this paper, we propose a novel and effective embedding size selection scheme. Specifically, we design an Adaptively-Masked Twins-based Layer (AMTL) behind the standard embedding layer. AMTL generates a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Text and Document Classification Technologies · Advanced Graph Neural Networks