Implementation of PFC and RCM for RoCEv2 Simulation in OMNeT++
Qian Liu, Robert D. Russell, Fabrice Mizero, Malathi Veeraraghavan,, John Dennis, Benjamin Jamroz

TL;DR
This paper presents the implementation of PFC and RCM protocols in OMNeT++ to simulate and analyze congestion control in complex data center and supercomputer networks using RoCEv2 interconnects.
Contribution
It introduces a simulation framework for PFC and RCM in OMNeT++, enabling detailed analysis of congestion control in RoCEv2 networks.
Findings
Simulation helps identify congestion points.
Optimal settings can be determined for congestion mitigation.
Framework supports complex network topology analysis.
Abstract
As traffic patterns and network topologies become more and more complicated in current enterprise data centers and TOP500 supercomputers, the probability of network congestion increases. If no countermeasures are taken, network congestion causes long communication delays and degrades network performance. A congestion control mechanism is often provided to reduce the consequences of congestion. However, it is usually difficult to configure and activate a congestion control mechanism in production clusters and supercomputers due to concerns that it may negatively impact jobs if the mechanism is not appropriately configured. Therefore, simulations for these situations are necessary to identify congestion points and sources, and more importantly, to determine optimal settings that can be utilized to reduce congestion in those complicated networks. In this paper, we use OMNeT++ to implement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterconnection Networks and Systems · Parallel Computing and Optimization Techniques · Embedded Systems Design Techniques
