Co-Design Optimization for Data Center Cooling System via Digital Twin
Shrenik Jadhav, Zheng Liu

TL;DR
This paper introduces a three-layer optimization framework for efficient coolant distribution in liquid-cooled supercomputers, achieving significant energy savings and demonstrating the effectiveness of flow fraction optimization.
Contribution
It develops a systematic co-design optimization approach for coolant distribution and flow control in large-scale data center cooling systems.
Findings
Global optimal design reduces energy use by 35.48%.
Flow fraction optimization significantly improves performance.
Framework is adaptable to other liquid-cooled data centers.
Abstract
Liquid-cooled exascale supercomputers dissipate heat through cooling plants organized as multiple parallel subloops, but how to allocate coolant distribution units (CDUs) across subloops and how to distribute flow among them has not been systematically addressed for facilities at this scale. This paper presents a three-layer optimization framework that jointly determines the integer partition of CDUs across subloops, the continuous flow fraction allocation, and the per-timestep co-design optimization of total flow rate and supply temperature subject to per-subloop thermal safety constraints. The Modelica simulation model is built based on the data of Frontier exascale supercomputer at Oak Ridge National Laboratory. By developing a reduced-order surrogate model, all 611 feasible partitions of 25 CDUs are evaluated across the full year operational dataset of 49,353 timesteps. Three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
