Modeling and Controlling Many-Core HPC Processors: an Alternative to PID and Moving Average Algorithms
Giovanni Bambini, Alessandro Ottaviano, Christian Conficoni, Andrea, Tilli, Luca Benini, Andrea Bartolini

TL;DR
This paper develops a detailed thermal and power model for HPC MPSoCs, analyzing coupling effects, and proposes a fuzzy control-based thermal capping strategy that outperforms PID controllers in managing temperature and power.
Contribution
It introduces a comprehensive thermal and power model considering real-world coupling effects and proposes a novel fuzzy control strategy to improve thermal management in HPC MPSoCs.
Findings
Up to 5x reduction in maximum temperature exceedance
Average 3.56% faster application runtime
Effective control of thermal and power coupling effects
Abstract
The race towards performance increase and computing power has led to chips with heterogeneous and complex designs, integrating an ever-growing number of cores on the same monolithic chip or chiplet silicon die. Higher integration density, compounded with the slowdown of technology-driven power reduction, implies that power and thermal management become increasingly relevant. Unfortunately, existing research lacks a detailed analysis and modeling of thermal, power, and electrical coupling effects and how they have to be jointly considered to perform dynamic control of complex and heterogeneous Multi-Processor System on Chips (MPSoCs). To close the gap, in this work, we first provide a detailed thermal and power model targeting a modern High Performance Computing (HPC) MPSoC. We consider real-world coupling effects such as actuators' non-idealities and the exponential relation between the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Distributed and Parallel Computing Systems
