Roomy: A System for Space Limited Computations
Daniel Kunkle

TL;DR
Roomy is a C/C++ library that enables large-scale symbolic algebra computations by transparently extending RAM with local disk storage, facilitating handling of problems exceeding distributed RAM capacity.
Contribution
It introduces a minimally invasive system that seamlessly integrates disk storage into cluster computations, overcoming memory limitations for large symbolic algebra problems.
Findings
Enables handling of problems larger than distributed RAM
Provides simple data structures and parallel constructs
Hides parallelism and I/O complexities within the library
Abstract
There are numerous examples of problems in symbolic algebra in which the required storage grows far beyond the limitations even of the distributed RAM of a cluster. Often this limitation determines how large a problem one can solve in practice. Roomy provides a minimally invasive system to modify the code for such a computation, in order to use the local disks of a cluster or a SAN as a transparent extension of RAM. Roomy is implemented as a C/C++ library. It provides some simple data structures (arrays, unordered lists, and hash tables). Some typical programming constructs that one might employ in Roomy are: map, reduce, duplicate elimination, chain reduction, pair reduction, and breadth-first search. All aspects of parallelism and remote I/O are hidden within the Roomy library.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Advanced Data Storage Technologies · Distributed systems and fault tolerance
