PIFS-Rec: Process-In-Fabric-Switch for Large-Scale Recommendation System Inferences
Pingyi Huo, Anusha Devulapally, Hasan Al Maruf, Minseo Park,, Krishnakumar Nair, Meena Arunachalam, Gulsum Gudukbay Akbulut, Mahmut Taylan, Kandemir, Vijaykrishnan Narayanan

TL;DR
This paper introduces PIFS-Rec, a novel process-in-fabric-switch approach that significantly accelerates large-scale deep learning recommendation models on CXL-enabled systems by reducing latency and improving scalability.
Contribution
The paper proposes PIFS-Rec, a new PIFS-based scheme implementing near-data processing to optimize DLRM performance on CXL systems, outperforming existing solutions.
Findings
PIFS-Rec reduces latency by 3.89x compared to Pond.
PIFS-Rec outperforms BEACON by 2.03x.
Characterization of industry-scale DLRM workloads on CXL systems.
Abstract
Deep Learning Recommendation Models (DLRMs) have become increasingly popular and prevalent in today's datacenters, consuming most of the AI inference cycles. The performance of DLRMs is heavily influenced by available bandwidth due to their large vector sizes in embedding tables and concurrent accesses. To achieve substantial improvements over existing solutions, novel approaches towards DLRM optimization are needed, especially, in the context of emerging interconnect technologies like CXL. This study delves into exploring CXL-enabled systems, implementing a process-in-fabric-switch (PIFS) solution to accelerate DLRMs while optimizing their memory and bandwidth scalability. We present an in-depth characterization of industry-scale DLRM workloads running on CXL-ready systems, identifying the predominant bottlenecks in existing CXL systems. We, therefore, propose PIFS-Rec, a PIFS-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques
