Wattchmen: Watching the Wattchers -- High Fidelity, Flexible GPU Energy Modeling
Brandon Tran, Matthias Maiterth, Woong Shin, Matthew D. Sinclair, and Shivaram Venkataraman

TL;DR
Wattchmen is a flexible, instruction-level GPU energy modeling methodology that improves accuracy and applicability across architectures, enabling detailed energy analysis and reductions in high-performance computing workloads.
Contribution
It introduces a per-instruction energy model for GPUs that outperforms existing systems in accuracy and extends to multiple architectures, providing detailed energy insights.
Findings
Wattchmen achieves a mean absolute percent error of 14% on V100 GPUs.
It maintains similar accuracy on water-cooled V100s and extends to A100 and H100 architectures.
Applying Wattchmen enables energy reductions of up to 35% in applications like Backprop and QMCPACK.
Abstract
Modern GPU-rich HPC systems are increasingly becoming energy-constrained. Thus, understanding an application's energy consumption becomes essential. Unfortunately, current GPU energy attribution techniques are either inaccurate, inflexible, or outdated. Therefore, we propose Wattchmen, a flexible methodology for measuring, attributing, and predicting GPU energy consumption. We construct a per-instruction energy model using a diverse set of microbenchmarks to systematically quantify the energy consumption of GPU instructions, enabling finer-grain prediction and energy consumption breakdowns for applications. Compared with the state-of-the-art systems like AccelWattch (32%) and Guser (25%), across 16 popular GPGPU, graph analytics, HPC, and ML workloads, Wattchmen reduces the mean absolute percent error (MAPE) to 14% on V100 GPUs. Furthermore, we show that Wattchmen provides similar MAPEs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
