Software-Distributed Shared Memory for Heterogeneous Machines: Design and Use Considerations
Lo\"ic Cudennec (DACLE-LIST, DGA.MI)

TL;DR
This paper presents SAT, a software-DSM system designed for heterogeneous microserver architectures, featuring automatic data decomposition, multiple coherence protocols, and energy-efficient event-driven programming.
Contribution
It introduces SAT, a novel software-DSM tailored for heterogeneous architectures, with automatic chunking, multi-protocol management, and energy-saving mechanisms.
Findings
SAT enables efficient shared data management on microservers.
The system supports multiple coherence protocols for flexibility.
Energy consumption is reduced through a micro-sleep mechanism.
Abstract
Distributed shared memory (DSM) allows to implement and deploy applications onto distributed architectures using the convenient shared memory programming model in which a set of tasks are able to allocate and access data despite their remote localization. With the development of distributed heterogeneous architectures in both HPC and embedded contexts, there is a renewal of interest for systems such as DSM that ease the programmability of complex hardware. In this report, some design considerations are given to build a complete software-DSM (S-DSM). This S-DSM called SAT (Share Among Things) is developed at CEA (the French Alternative Energies and Atomic Energy Commission) within the framework of European project M2DC (Modular Microserver DataCentre) to tackle the problem of managing shared data over microserver architec-tures. The S-DSM features the automatic decomposition of large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Parallel Computing and Optimization Techniques · Distributed systems and fault tolerance
