Architectural Support for Efficient Data Movement in Disaggregated Systems
Christina Giannoula, Kailong Huang, Jonathan Tang, Nectarios Koziris,, Georgios Goumas, Zeshan Chishti, Nandita Vijaykumar

TL;DR
This paper presents DaeMon, a software-transparent mechanism that reduces data movement overheads in disaggregated systems by enhancing hardware with specialized engines and employing adaptive data migration strategies, improving performance and cost-efficiency.
Contribution
It introduces DaeMon, the first robust, software-transparent system for efficient data movement in disaggregated architectures, combining hardware enhancements and adaptive migration techniques.
Findings
Significantly improves system performance over page-granularity data movement.
Reduces data access costs in disaggregated systems.
Enhances scalability and robustness across various network conditions.
Abstract
Resource disaggregation offers a cost effective solution to resource scaling, utilization, and failure-handling in data centers by physically separating hardware devices in a server. Servers are architected as pools of processor, memory, and storage devices, organized as independent failure-isolated components interconnected by a high-bandwidth network. A critical challenge, however, is the high performance penalty of accessing data from a remote memory module over the network. Addressing this challenge is difficult as disaggregated systems have high runtime variability in network latencies/bandwidth, and page migration can significantly delay critical path cache line accesses in other pages. This paper introduces DaeMon, the first software-transparent and robust mechanism to significantly alleviate data movement overheads in fully disaggregated systems. First, to enable scalability to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Advanced Data Storage Technologies
