In-storage Processing of I/O Intensive Applications on Computational Storage Drives
Ali HeydariGorji, Mahdi Torabzadehkashi, Siavash Rezaei, Hossein, Bobarshad, Vladimir Alves, Pai H. Chou

TL;DR
This paper introduces Solana, a high-capacity computational storage drive that enables in-storage processing, significantly improving performance and energy efficiency for data analytics and NLP applications.
Contribution
It presents the first high-capacity CSD prototype, demonstrating practical benefits of in-storage processing for big-data and NLP workloads.
Findings
Up to 3.1x speedup in processing time
67% reduction in energy consumption
68% less data transfer needed
Abstract
Computational storage drives (CSD) are solid-state drives (SSD) empowered by general-purpose processors that can perform in-storage processing. They have the potential to improve both performance and energy significantly for big-data analytics by bringing compute to data, thereby eliminating costly data transfer while offering better privacy. In this work, we introduce Solana, the first-ever high-capacity(12-TB) CSD in E1.S form factor, and present an actual prototype for evaluation. To demonstrate the benefits of in-storage processing on CSD, we deploy several natural language processing (NLP) applications on datacenter-grade storage servers comprised of clusters of the Solana. Experimental results show up to 3.1x speedup in processing while reducing the energy consumption and data transfer by 67% and 68%, respectively, compared to regular enterprise SSDs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Parallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems
