On-Disk Data Processing: Issues and Future Directions
Mayank Mishra, Arun K. Somani

TL;DR
This paper surveys on-disk data processing (ODDP), discussing various architectures, recent advancements with SSDs, challenges, and future research directions in near-data processing where storage devices perform data computations.
Contribution
It provides a comprehensive review of ODDP architectures, analyzes recent SSD-based solutions, and outlines future challenges and directions for on-disk data processing research.
Findings
ODDP schemes vary widely in capabilities and applications.
Recent SSD advancements enable more extensive ODDP solutions.
Identifies key challenges and future research directions in ODDP.
Abstract
In this paper, we present a survey of "on-disk" data processing (ODDP). ODDP, which is a form of near-data processing, refers to the computing arrangement where the secondary storage drives have the data processing capability. Proposed ODDP schemes vary widely in terms of the data processing capability, target applications, architecture and the kind of storage drive employed. Some ODDP schemes provide only a specific but heavily used operation like sort whereas some provide a full range of operations. Recently, with the advent of Solid State Drives, powerful and extensive ODDP solutions have been proposed. In this paper, we present a thorough review of architectures developed for different on-disk processing approaches along with current and future challenges and also identify the future directions which ODDP can take.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Advanced Data Storage Technologies · Scientific Computing and Data Management
