FinGraV: Methodology for Fine-Grain GPU Power Visibility and Insights
Varsha Singhania, Shaizeen Aga, Mohamed Assem Ibrahim

TL;DR
FinGraV introduces a methodology for high-fidelity, fine-grain GPU power measurement tailored for AI workloads, enabling detailed insights into GPU power consumption and facilitating power optimization strategies.
Contribution
The paper presents FinGraV, a novel approach combining execution time binning, synchronization, and power differentiation to accurately profile GPU power at fine granularity.
Findings
Detailed GPU power profiles across AI workloads
Insights into GPU sub-component power consumption
Guidance on accurate power measurement techniques
Abstract
Ubiquity of AI makes optimizing GPU power a priority as large GPU-based clusters are often employed to train and serve AI models. An important first step in optimizing GPU power consumption is high-fidelity and fine-grain power measurement of key AI computations on GPUs. To this end, we observe that as GPUs get more powerful, the resulting sub-millisecond to millisecond executions make fine-grain power analysis challenging. In this work, we first carefully identify the challenges in obtaining fine-grain GPU power profiles. To address these challenges, we devise FinGraV methodology where we employ execution time binning, careful CPU-GPU time synchronization, and power profile differentiation to collect fine-grain GPU power profiles across prominent AI computations and across a spectrum of scenarios. Using the said FinGraV power profiles, we provide both, guidance on accurate power…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSilicon and Solar Cell Technologies · Modular Robots and Swarm Intelligence · 3D IC and TSV technologies
