PowerTrip: Exploiting Federated Heterogeneous Datacenter Power for Distributed ML Training
Talha Mehboob, Luanzheng Guo, Nathan Tallent, Michael Zink, David Irwin

TL;DR
PowerTrip is a system that dynamically selects geographically distributed data center sites for large-scale AI training, optimizing power use and communication to significantly reduce training time.
Contribution
It introduces a novel runtime site selection method based on power and network latency, addressing heterogeneity in power availability for distributed ML training.
Findings
Reduces training time-to-accuracy by up to 50%.
Effectively balances power availability and communication costs.
Demonstrates benefits using real-world power traces.
Abstract
The exponential growth of large-scale AI models has led to computational and power demands that can exceed the capacity of a single data center. This is due to the limited power supplied by regional grids that leads to limited regional computational power. Consequently, distributing training workloads across geographically distributed sites has become essential. However, this approach introduces a significant challenge in the form of communication overhead, creating a fundamental trade-off between the performance gains from accessing greater aggregate power and the performance losses from increased network latency. Although prior work has focused on reducing communication volume or using heuristics for distribution, these methods assume constant homogeneous power supplies and ignore the challenge of heterogeneous power availability between sites. To address the challenge of training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Data Storage Technologies · Advanced Neural Network Applications
