Generative Design for Direct-to-Chip Liquid Cooling for Data Centers
Zheng Liu

TL;DR
This paper introduces a generative design framework that optimizes cooling channel geometries for data center chips, significantly reducing hot spots and improving thermal management efficiency.
Contribution
It presents a physics-based generative design method that automatically creates cooling channels tailored to heterogeneous chip temperature distributions.
Findings
Achieved over 5°C reduction in average chip temperature.
Reduced maximum temperature by more than 35°C compared to baseline.
Demonstrated the effectiveness of physics-informed generative design in thermal management.
Abstract
Rapid growth in artificial intelligence (AI) workloads is driving up data center power densities, increasing the need for advanced thermal management. Direct-to-chip liquid cooling can remove heat efficiently at the source, but many cold plate channel layouts remain heuristic and are not optimized for the strongly non-uniform temperature distribution of modern heterogeneous packages. This work presents a generative design framework for synthesizing cooling channel geometries for the NVIDIA GB200 Grace Blackwell Superchip. A physics-based finite-difference thermal model provides rapid steady-state temperature predictions and supplies spatial thermal feedback to a constrained reaction-diffusion process that generates novel channel topologies while enforcing inlet/outlet and component constraints. By iterating channel generation and thermal evaluation in a closed loop, the method naturally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
