Attention-based workload prediction and dynamic resource allocation for heterogeneous computing environments
Shijia Shao, Xinyi Ding, Biao Zhao, Peiqing Ye

TL;DR
This paper introduces a smart system that predicts workloads and allocates resources efficiently in data centers with diverse computing hardware.
Contribution
The novel framework uses attention mechanisms for workload prediction and dynamic resource allocation in heterogeneous environments.
Findings
The system achieves 78.4% resource utilization with only 2.3% SLA violations.
It reduces average task completion time by 25.8% and energy consumption by 15.1%.
The framework improves infrastructure efficiency while maintaining service quality.
Abstract
The rapid proliferation of artificial intelligence applications in modern data centers demands intelligent resource management strategies that can effectively handle diverse workloads across heterogeneous computing infrastructures. This paper proposes an integrated framework that combines multi-head spatial-temporal attention mechanisms for workload prediction with dynamic resource allocation algorithms optimized for heterogeneous environments. The spatial-temporal attention architecture separately models temporal evolution patterns within individual workload streams and spatial correlations across concurrent task types, enabling accurate forecasting of resource demands. The allocation framework formulates resource assignment as a multi-objective optimization problem that jointly considers performance, energy efficiency, and utilization while explicitly accounting for prediction…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Big Data and Digital Economy · Distributed and Parallel Computing Systems
