RoCE BALBOA: Service-enhanced Data Center RDMA for SmartNICs
Maximilian Jakob Heer, Benjamin Ramhorst, Yu Zhu, Luhao Liu, Zhiyi Hu, Jonas Dann, Gustavo Alonso

TL;DR
RoCE BALBOA is an open-source, scalable RDMA stack compatible with RoCE v2, enabling customizable, high-performance network acceleration and smartNIC development for data centers, demonstrated through FPGA deployment and diverse use cases.
Contribution
The paper introduces RoCE BALBOA, a flexible, open-source RDMA implementation that supports high scalability and customization for smartNICs and accelerators in data centers.
Findings
Performance comparable to commercial NICs
Supports hundreds of queue-pairs at 100G
Enables new smartNIC and accelerator designs
Abstract
Data-intensive applications in data centers, especially machine learning (ML), have made the network a bottleneck, which in turn has motivated the development of more efficient network protocols and infrastructure. For instance, remote direct memory access (RDMA) has become the standard protocol for data transport in the cloud as it minimizes data copies and reduces CPU-utilization via host-bypassing. Similarly, an increasing amount of network functions and infrastructure have moved to accelerators, SmartNICs, and in-network computing to bypass the CPU. In this paper we explore the implementation and deployment of RoCE BALBOA, an open-source, RoCE v2-compatible, scalable up to hundreds of queue-pairs, and 100G-capable RDMA-stack that can be used as the basis for building accelerators and smartNICs. RoCE BALBOA is customizable, opening up a design space and offering a degree of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Software-Defined Networks and 5G · Network Packet Processing and Optimization
