Demystifying the Performance of Data Transfers in High-Performance Research Networks
Ehsan Saeedizade, Bing Zhang, Engin Arslan

TL;DR
This paper introduces a scalable monitoring framework for high-performance research networks that captures detailed transfer metrics and identifies performance issues, helping optimize data transfer efficiency.
Contribution
The paper presents a novel end-to-end monitoring system capable of analyzing thousands of transfers simultaneously and a heuristic method for root cause analysis of performance anomalies.
Findings
Monitors up to 400 transfers per host and over 40,000 transfers overall.
Collects performance metrics with one-second precision.
Achieves 87-98% F-score in identifying root causes of anomalies.
Abstract
High-speed research networks are built to meet the ever-increasing needs of data-intensive distributed workflows. However, data transfers in these networks often fail to attain the promised transfer rates for several reasons, including I/O and network interference, server misconfigurations, and network anomalies. Although understanding the root causes of performance issues is critical to mitigating them and increasing the utilization of expensive network infrastructures, there is currently no available mechanism to monitor data transfers in these networks. In this paper, we present a scalable, end-to-end monitoring framework to gather and store key performance metrics for file transfers to shed light on the performance of transfers. The evaluation results show that the proposed framework can monitor up to 400 transfers per host and more than 40, 000 transfers in total while collecting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Cloud Computing and Resource Management · Advanced MRI Techniques and Applications
Methodsfail
