RFP: A Remote Fetching Paradigm for RDMA-Accelerated Systems
Maomeng Su, Mingxing Zhang, Kang Chen, Yongwei Wu, and Guoliang Li

TL;DR
This paper introduces the Remote Fetching Paradigm (RFP) for RDMA systems, leveraging the asymmetry in RDMA-read/write speeds to significantly enhance IOPS in in-memory key-value stores.
Contribution
The paper proposes a novel RDMA communication paradigm that improves performance by using in-bound RDMA-read instead of out-bound RDMA-write, with implementation and optimization for key-value stores.
Findings
RFP achieves 160%-310% higher IOPS than existing models.
In-bound RDMA-read is 5 times faster than out-bound RDMA-write.
Optimization combining status checking and result fetching further enhances performance.
Abstract
Remote Direct Memory Access (RDMA) is an efficient way to improve the performance of traditional client-server systems. Currently, there are two main design paradigms for RDMA-accelerated systems. The first allows the clients to directly operate the server's memory and totally bypasses the CPUs at server side. The second follows the traditional server-reply paradigm, which asks the server to write results back to the clients. However, the first method has to expose server's memory and needs tremendous re-design of upper-layer software, which is complex, unsafe, error-prone, and inefficient. The second cannot achieve high input/output operations per second (IOPS), because it employs out-bound RDMA-write at server side which is not efficient. We find that the performance of out-bound RDMA-write and in-bound RDMA-read is asymmetric and the latter is 5 times faster than the former. Based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed and Parallel Computing Systems · Distributed systems and fault tolerance
