PyPOD-GP: Using PyTorch for Accelerated Chip-Level Thermal Simulation of the GPU
Neil He, Ming-Cheng Cheng, Yu Liu

TL;DR
PyPOD-GP introduces a GPU-optimized PyTorch library for chip-level GPU thermal simulation, achieving significant speedups over traditional MPI-based methods while maintaining high accuracy.
Contribution
The paper presents PyPOD-GP, a novel GPU-accelerated library using PyTorch for efficient chip-level thermal simulation with high accuracy.
Findings
Over 23.4x faster training speed
Over 10x faster inference speed
Only 1.2% error compared to device layer
Abstract
The rising demand for high-performance computing (HPC) has made full-chip dynamic thermal simulation in many-core GPUs critical for optimizing performance and extending device lifespans. Proper orthogonal decomposition (POD) with Galerkin projection (GP) has shown to offer high accuracy and massive runtime improvements over direct numerical simulation (DNS). However, previous implementations of POD-GP use MPI-based libraries like PETSc and FEniCS and face significant runtime bottlenecks. We propose a Torch-based library (PyPOD-GP), a GPU-optimized library for chip-level thermal simulation. PyPOD-GP achieves over speedup in training and over speedup in inference on a GPU with over 13,000 cores, with just error over the device layer.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Heat Transfer and Optimization · Embedded Systems Design Techniques
