RecFlash: Fast Recommendation System on In-Storage Computing with Frequency-Based Data Mapping
Jangho Baik, Sunghyun Kim, Gisan Ji, Wonbo Shim, Sungju Ryu

TL;DR
RecFlash is a novel in-storage computing accelerator that uses frequency-based data remapping to significantly improve the latency and energy efficiency of recommendation system inference on NAND flash memory.
Contribution
It introduces a data remapping algorithm tailored for recommendation systems to optimize NAND flash-based in-storage computing performance.
Findings
Latency improved by up to 81%
Energy consumption reduced by up to 91.9%
Outperforms existing NAND flash-based ISC architectures
Abstract
Recommendation system has gained a large popularity for a variety of personalized suggestion tasks, but the ever-increasing number of user data makes real-time processing of recommendation systems difficult. NAND flash memory-based in-storage computing scheme can be one of favorable candidates among the various acceleration approaches because the flash memory typically has a larger memory capacity than the other memory types, so it can efficiently handle a large amount of user data for the recommendation inference services. However, different from other neural network applications where data is sequentially fetched from memory, the recommendation system shows the irregular random memory access pattern. Hence, most of the data loaded from the NAND flash array to the page buffer are not used, so a large portion of the internal bandwidth is underutilized, which degrades the performance on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
