Learnings from an Under the Hood Analysis of an Object Storage Node IO Stack
Pratik Mishra, Rekha Pitchumani, Yang Suk Kee

TL;DR
This paper analyzes the inefficiencies in traditional OS-based object storage stacks, highlighting how complex data management policies hinder performance, and explores object-drives as a promising solution to reduce overheads and improve efficiency.
Contribution
It provides a comprehensive under-the-hood analysis of object-storage node I/O stacks, identifying performance bottlenecks and proposing object-drives as a means to simplify storage architectures.
Findings
Legacy OS I/O stacks cause significant performance overheads.
Object-drives can reduce data management overheads by 20-38%.
Complex data policies in OS stacks lead to underutilization of storage device capabilities.
Abstract
Conventional object-stores are built on top of traditional OS storage stack, where I/O requests typically transfers through multiple hefty and redundant layers. The complexity of object management has grown dramatically with the ever increasing requirements of performance, consistency and fault-tolerance from storage subsystems. Simply stated, more number of intermediate layers are encountered in the I/O data path, with each passing layer adding its own syntax and semantics. Thereby increasing the overheads of request processing. In this paper, through comprehensive under-the-hood analysis of an object-storage node, we characterize the impact of object-store (and user-application) workloads on the OS I/O stack and its subsequent rippling effect on the underlying object-storage devices (OSD). We observe that the legacy architecture of the OS based I/O storage stack coupled with complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Distributed and Parallel Computing Systems · Cloud Computing and Resource Management
