RDMAbox : Optimizing RDMA for Memory Intensive Workloads
Juhyun Bae, Ling Liu, Yanzhao Wu, Gong Su, Arun Iyengar

TL;DR
RDMAbox introduces low-level RDMA optimizations with flexible libraries, significantly improving performance for memory-intensive workloads like remote paging and file systems in data centers.
Contribution
The paper presents novel RDMA request merging, chaining, and adaptive polling techniques that enhance throughput and latency, outperforming existing solutions.
Findings
Up to 4× throughput improvement in remote paging
Up to 83% reduction in tail latency for big data workloads
Up to 5.9× higher throughput in user space file systems
Abstract
We present RDMAbox, a set of low level RDMA optimizations that provide better performance than previous approaches. The optimizations are packaged in easy-to-use kernel and user space libraries for applications and systems in data center. We demonstrate the flexibility and effectiveness of RDMAbox by implementing a kernel remote paging system and a user space file system using RDMAbox. RDMAbox employs two optimization techniques. First, we suggest RDMA request merging and chaining to further reduce the total number of I/O operations to the RDMA NIC. The I/O merge queue at the same time functions as a traffic regulator to enforce admission control and avoid overloading the NIC. Second, we propose Adaptive Polling to achieve higher efficiency of polling Work Completion than existing busy polling while maintaining the low CPU overhead of event trigger. Our implementation of a remote paging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Advanced Data Storage Technologies · Parallel Computing and Optimization Techniques
