LLMapReduce: Multi-Level Map-Reduce for High Performance Data Analysis
Chansup Byun, Jeremy Kepner, William Arcand, David Bestor, Bill, Bergeron, Vijay Gadepally, Matthew Hubbell, Peter Michaleas, Julie Mullen,, Andrew Prout, Antonio Rosa, Charles Yee, Albert Reuther

TL;DR
LLMapReduce enhances the traditional map-reduce model for supercomputers, simplifying parallel programming, supporting multiple languages, and overcoming scaling limits to improve performance significantly.
Contribution
It introduces a multi-level map-reduce framework that simplifies programming and enhances scalability on supercomputers without requiring application modifications.
Findings
Reduces computational overhead by over 10x for certain applications
Supports multiple schedulers like SLURM, Grid Engine, and LSF
Widely adopted at MIT by hundreds of users
Abstract
The map-reduce parallel programming model has become extremely popular in the big data community. Many big data workloads can benefit from the enhanced performance offered by supercomputers. LLMapReduce provides the familiar map-reduce parallel programming model to big data users running on a supercomputer. LLMapReduce dramatically simplifies map-reduce programming by providing simple parallel programming capability in one line of code. LLMapReduce supports all programming languages and many schedulers. LLMapReduce can work with any application without the need to modify the application. Furthermore, LLMapReduce can overcome scaling limits in the map-reduce parallel programming model via options that allow the user to switch to the more efficient single-program-multiple-data (SPMD) parallel programming model. These features allow users to reduce the computational overhead by more than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
