CkIO: Parallel File Input for Over-Decomposed Task-Based Systems
Mathew Jacob, Maya Taylor, Laxmikant Kale

TL;DR
This paper introduces CkIO, a new parallel file input library for over-decomposed task-based systems like Charm++, addressing input performance challenges in large-scale applications through a novel abstraction that separates data consumers from file readers.
Contribution
The paper presents CkIO, a configurable input library that improves scalability, load balancing, and overlap of input with computation in over-decomposed systems, filling a gap in parallel I/O performance.
Findings
CkIO enables scalable input processing in over-decomposed systems.
It improves load balancing and data migration capabilities.
Preliminary performance data shows promising results.
Abstract
Parallel input performance issues are often neglected in large scale parallel applications in Computational Science and Engineering. Traditionally, there has been less focus on input performance because either input sizes are small (as in biomolecular simulations) or the time doing input is insignificant compared with the simulation with many timesteps. But newer applications, such as graph algorithms add a premium to file input performance. Additionally, over-decomposed systems, such as Charm++/AMPI, present new challenges in this context in comparison to MPI applications. In the over-decomposition model, naive parallel I/O in which every task makes its own I/O request is impractical. Furthermore, load balancing supported by models such as Charm++/AMPI precludes assumption of data contiguity on individual nodes. We develop a new I/O abstraction to address these issues by separating the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Parallel Computing and Optimization Techniques · Cloud Computing and Resource Management
