Joint Energy Management and Coordinated AIGC Workload Scheduling for Distributed Data Centers: A Diffusion-Aided Reward Shaping Approach
Yang Fu, Peng Qin, Liming Chen, Zihao Zhang, Hao Yu, Yifei Wang

TL;DR
This paper presents a novel framework for energy-efficient AIGC workload scheduling in distributed data centers, integrating diffusion models with reinforcement learning to handle reward sparsity and model heterogeneity.
Contribution
It introduces a diffusion-aided reward shaping method within a joint energy management and workload scheduling framework, addressing challenges of reward sparsity and model heterogeneity.
Findings
The proposed method improves convergence speed of reinforcement learning algorithms.
It effectively balances energy costs and service quality in real-world data center scenarios.
The approach outperforms benchmark methods in system utility and adaptability.
Abstract
Artificial intelligence-generated content (AIGC) has emerged as a transformative paradigm for automating the creation of diverse and customized content, giving rise to rapidly growing computational workloads in cloud data centers. It is imperative for AIGC service providers (ASPs) to strategically schedule AIGC workloads to reduce data center energy costs while guaranteeing high-quality content generation. However, the distinctive characteristics of AIGC services pose critical challenges, including model heterogeneity across ASPs, implicit service quality evaluation, and complex inference process control. To tackle these challenges, we propose a joint energy management and coordinated AIGC workload scheduling framework, which introduces an explicit mathematical characterization of service quality to promote both job transfer among ASPs and fine-grained inference process configuration.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
