DaeMon: Architectural Support for Efficient Data Movement in Disaggregated Systems
Christina Giannoula, Kailong Huang, Jonathan Tang, Nectarios Koziris,, Georgios Goumas, Zeshan Chishti, Nandita Vijaykumar

TL;DR
DaeMon is a software-transparent system that significantly reduces data movement overheads in disaggregated data center architectures by employing specialized engines and adaptive data transfer techniques, improving performance and access costs.
Contribution
It introduces DaeMon, the first mechanism to transparently optimize data movement in disaggregated systems, enhancing scalability and performance across diverse workloads.
Findings
DaeMon achieves 2.39× performance improvement over page-granularity data movement.
DaeMon reduces data access costs by 3.06×.
The approach effectively handles high variability in network latency and bandwidth.
Abstract
Resource disaggregation offers a cost effective solution to resource scaling, utilization, and failure-handling in data centers by physically separating hardware devices in a server. Servers are architected as pools of processor, memory, and storage devices, organized as independent failure-isolated components interconnected by a high-bandwidth network. A critical challenge, however, is the high performance penalty of accessing data from a remote memory module over the network. Addressing this challenge is difficult as disaggregated systems have high runtime variability in network latencies/bandwidth, and page migration can significantly delay critical path cache line accesses in other pages. This paper conducts a characterization analysis on different data movement strategies in fully disaggregated systems, evaluates their performance overheads in a variety of workloads, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · IoT and Edge/Fog Computing
