Congestion Management in High-Performance Interconnection Networks Using Adaptive Routing Notifications
Jose Rocher-Gonzalez, Jesus Escudero-Sahuquillo, Pedro J. Garcia,, Francisco J. Quiles

TL;DR
This paper introduces a novel congestion management strategy for high-performance interconnection networks that uses adaptive routing notifications to effectively isolate and mitigate congestion, improving overall network performance.
Contribution
The paper proposes a new congestion management approach leveraging existing adaptive routing notifications to better isolate congesting flows in high-performance networks.
Findings
Reduces congestion spreading in network simulations
Improves network throughput and latency
Effectively isolates congesting flows
Abstract
The interconnection network is a crucial subsystem in High-Performance Computing clusters and Data-centers, guaranteeing high bandwidth and low latency to the applications' communication operations. Unfortunately, congestion situations may spoil network performance unless the network design applies specific countermeasures. Adaptive routing algorithms are a traditional approach to dealing with congestion since they provide traffic flows with alternative routes that bypass congested areas. However, adaptive routing decisions at switches are typically based on local information without a global network traffic perspective, leading to congestion spreading throughout the network beyond the original congested areas. In this paper, we propose a new efficient congestion management strategy that leverages adaptive routing notifications currently available in some interconnect technologies and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
