Improving I/O Performance for Exascale Applications through Online Data Layout Reorganization
Lipeng Wan, Axel Huebl, Junmin Gu, Franz Poeschel, Ana Gainaru, Ruonan, Wang, Jieyang Chen, Xin Liang, Dmitry Ganyushin, Todd Munson, Ian Foster,, Jean-Luc Vay, Norbert Podhorszki, Kesheng Wu, Scott Klasky

TL;DR
This paper presents online data layout reorganization techniques to improve I/O performance for Exascale applications, demonstrating significant read speed improvements in particle-in-cell simulations.
Contribution
It introduces two novel online data layout reorganization methods tailored for Exascale I/O challenges, enhancing read/write efficiency for dynamic, irregular data.
Findings
Read performance increased by over 80%
Effective data layout design improves I/O efficiency
Applicable to a broad class of Exascale applications
Abstract
The applications being developed within the U.S. Exascale Computing Project (ECP) to run on imminent Exascale computers will generate scientific results with unprecedented fidelity and record turn-around time. Many of these codes are based on particle-mesh methods and use advanced algorithms, especially dynamic load-balancing and mesh-refinement, to achieve high performance on Exascale machines. Yet, as such algorithms improve parallel application efficiency, they raise new challenges for I/O logic due to their irregular and dynamic data distributions. Thus, while the enormous data rates of Exascale simulations already challenge existing file system write strategies, the need for efficient read and processing of generated data introduces additional constraints on the data layout strategies that can be used when writing data to secondary storage. We review these I/O challenges and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
