Proceedings of the 2015 International Workshop on the Lustre Ecosystem: Challenges and Opportunities
Neena Imam, Michael Brim, Sarp Oral

TL;DR
This paper discusses the challenges and opportunities in enhancing the Lustre parallel file system to support diverse workloads in high-performance and Big Data computing environments, emphasizing recent developments and future directions.
Contribution
It introduces the inaugural workshop focused on exploring improvements in Lustre's performance and flexibility for diverse applications and workloads.
Findings
Lustre is widely used in top supercomputers for high-performance storage.
The workshop initiated discussions on open challenges and technological advances.
Diverse workloads demand new features and enhancements in Lustre.
Abstract
The Lustre parallel file system has been widely adopted by high-performance computing (HPC) centers as an effective system for managing large-scale storage resources. Lustre achieves unprecedented aggregate performance by parallelizing I/O over file system clients and storage targets at extreme scales. Today, 7 out of 10 fastest supercomputers in the world use Lustre for high-performance storage. To date, Lustre development has focused on improving the performance and scalability of large-scale scientific workloads. In particular, large-scale checkpoint storage and retrieval, which is characterized by bursty I/O from coordinated parallel clients, has been the primary driver of Lustre development over the last decade. With the advent of extreme scale computing and Big Data computing, many HPC centers are seeing increased user interest in running diverse workloads that place new demands…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Cloud Computing and Resource Management · Distributed and Parallel Computing Systems
