Emulating a large memory with a collection of small ones
James Hanlon

TL;DR
This paper shows that large-memory sequential programs can be efficiently emulated on highly-parallel architectures with minimal slowdown, enabling a smooth transition from traditional sequential systems to parallel ones.
Contribution
It introduces a scalable architecture that emulates large memories using collections of smaller ones, achieving only 2-3 times slowdown for sequential programs.
Findings
Achieves 2-3x slowdown in emulating large memory on parallel architectures.
Supports efficient execution of sequential programs on parallel systems.
Facilitates transition from sequential to parallel computing models.
Abstract
Sequential computation is well understood but does not scale well with current technology. Within the next decade, systems will contain large numbers of processors with potentially thousands of processors per chip. Despite this, many computational problems exhibit little or no parallelism and many existing formulations are sequential. It is therefore essential that highly-parallel architectures can support sequential computation by emulating large memories with collections of smaller ones, thus supporting efficient execution of sequential programs or sequential components of parallel programs. This paper demonstrates that a realistic parallel architecture with scalable low-latency communications can execute large-memory sequential programs with a factor of only 2 to 3 slowdown, when compared to a conventional sequential architecture. This overhead seems an acceptable price to pay to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Interconnection Networks and Systems · Distributed systems and fault tolerance
