Design and Development of a Java Parallel I/O Library
Muhammad Sohaib Ayub, Muhammad Adnan, Muhammad Yasir Shafi

TL;DR
This paper introduces a Java parallel I/O library inspired by MPI-IO, aiming to improve I/O performance in Java-based HPC applications, which currently lack standardized, high-performance solutions.
Contribution
Development of a Java parallel I/O API modeled after MPI-IO, including a reference implementation and performance evaluation against existing C/C++ libraries.
Findings
Prototype Java I/O library developed
Performance benchmarks planned for comparison
Addresses gap in Java HPC I/O solutions
Abstract
Parallel I/O refers to the ability of scientific programs to concurrently read/write from/to a single file from multiple processes executing on distributed memory platforms like compute clusters. In the HPC world, I/O becomes a significant bottleneck for many real-world scientific applications. In the last two decades, there has been significant research in improving the performance of I/O operations in scientific computing for traditional languages including C, C++, and Fortran. As a result of this, several mature and high-performance libraries including ROMIO (implementation of MPI-IO), parallel HDF5, Parallel I/O (PIO), and parallel netCDF are available today that provide efficient I/O for scientific applications. However, there is very little research done to evaluate and improve I/O performance of Java-based HPC applications. The main hindrance in the development of efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Distributed and Parallel Computing Systems · Scientific Computing and Data Management
