Elastic Data Transfer Optimization with Hybrid Reinforcement Learning
Rasman Mubtasim Swargo, Md Arifuzzaman

TL;DR
This paper introduces ame, an adaptive data transfer system that uses hybrid reinforcement learning to optimize multiple parameters simultaneously, significantly improving throughput for large scientific data transfers.
Contribution
It presents a novel hybrid reinforcement learning approach with a lightweight simulator for efficient training, outperforming existing methods in data transfer throughput.
Findings
Achieves up to 9.5x higher throughput than state-of-the-art methods.
Reduces training time to less than four minutes with a 2750x speedup.
Effectively optimizes multiple transfer parameters simultaneously.
Abstract
Modern scientific data acquisition generates petabytes of data that must be transferred to geographically distant computing clusters. Conventional tools either rely on preconfigured sessions, which are difficult to tune for users without domain expertise, or they adaptively optimize only concurrency while ignoring other important parameters. We present \name, an adaptive data transfer method that jointly considers multiple parameters. Our solution incorporates heuristic-based parallelism, infinite pipelining, and a deep reinforcement learning based concurrency optimizer. To make agent training practical, we introduce a lightweight network simulator that reduces training time to less than four minutes and provides a speedup compared to online training. Experimental evaluation shows that \name consistently outperforms existing methods across diverse datasets, achieving up to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Stochastic Gradient Optimization Techniques · Cloud Computing and Resource Management
