A Physics-Aware Framework for Short-Term GPU Power Forecasting of AI Data Centers
Mohammad AlShaikh Saleh, Sanjay Chawla, Sertac Bayhan, Haitham Abu-Rub, and Ali Ghrayeb

TL;DR
This paper introduces PI-DLinear, a physics-informed time-series model that accurately forecasts short-term GPU power usage in AI data centers by integrating thermal physics with power consumption data.
Contribution
The paper presents the first physics-informed DLinear model for short-term GPU power forecasting, improving accuracy and physical consistency over existing models.
Findings
PI-DLinear outperforms state-of-the-art models in accuracy metrics.
The model respects physical laws during load transients and power throttling.
Forecasting accuracy improvements range from 0.782% to 51.82%.
Abstract
AI data centers experience rapid fluctuations in power demand due to the heterogeneity of computational tasks that they have to support. For example, the power profile of inference and training of large language models (LLMs) is quite distinct and big divergences can result in the instability of the underlying electricity grid. In this paper we propose, to the best of our knowledge, the first physics-informed DLinear time-series model that can accurately forecast power utilization of an AI data center 5-80 minutes (short-term forecasting) into the future. The physics, based on a multi-node lumped thermal resistance-capacitance (RC) network consistent with Newton's law of cooling, is captured using newly derived time-dependent ordinary differential equations (ODE) that separately models and interlinks power consumption with the GPU compute and memory utilization and temperature. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
