Stochastic Communication Avoidance for Recommendation Systems
Lutfi Eren Erdogan, Vijay Anand Raghava Kanakagiri, Kurt Keutzer, Zhen, Dong

TL;DR
This paper introduces a theoretical framework and algorithms to reduce communication costs in distributed recommendation systems with large embedding tables, significantly improving training throughput on GPUs.
Contribution
It presents a novel theoretical analysis and algorithms for communication-efficient distributed recommendation systems, addressing memory and communication bottlenecks.
Findings
Achieves up to 6x increase in training throughput on GPUs
Provides a theoretical framework for communication cost analysis
Demonstrates effectiveness across various datasets and memory constraints
Abstract
One of the major bottlenecks for efficient deployment of neural network based recommendation systems is the memory footprint of their embedding tables. Although many neural network based recommendation systems could benefit from the faster on-chip memory access and increased computational power of hardware accelerators, the large embedding tables in these models often cannot fit on the constrained memory of accelerators. Despite the pervasiveness of these models, prior methods in memory optimization and parallelism fail to address the memory and communication costs of large embedding tables on accelerators. As a result, the majority of models are trained on CPUs, while current implementations of accelerators are hindered by issues such as bottlenecks in inter-device communication and main memory lookups. In this paper, we propose a theoretical framework that analyses the communication…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Recommender Systems and Techniques · Cognitive Radio Networks and Spectrum Sensing
