Embedding Samples Dispatching for Recommendation Model Training in Edge Environments
Guopeng Li, Haisheng Tan, Chi Zhang, Hongqiu Ni, Zilong Wang, Xinyue Zhang, Yang Xu, Han Tian

TL;DR
This paper introduces ESD, a sample dispatching mechanism for edge-based deep learning recommendation models that significantly reduces transmission costs and accelerates training by optimizing sample distribution.
Contribution
The paper proposes ESD, a novel dispatching mechanism combining optimal and heuristic algorithms to minimize embedding transmission costs in edge environments.
Findings
ESD reduces embedding transmission costs by up to 36.76%.
ESD achieves up to 1.74x speedup in training.
Prototype experiments outperform state-of-the-art mechanisms.
Abstract
Training deep learning recommendation models (DLRMs) on edge workers brings several benefits, particularly in terms of data privacy protection, low latency and personalization. However, due to the huge size of embedding tables, typical DLRM training frameworks adopt one or more parameter servers to maintain global embedding tables, while leveraging the edge workers cache part of them. This incurs significant transmission cost for embedding transmissions between workers and parameter servers, which can dominate the training cycle. In this paper, we investigate how to dispatch input embedding samples to appropriate edge workers to minimize the total embedding transmission cost when facing edge-specific challenges such as heterogeneous networks and limited resources. We develop ESD, a novel mechanism that optimizes the dispatch of input embedding samples to edge workers based on expected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)
