What Is the Cost of Energy Monitoring? An Empirical Study on the Overhead of RAPL-Based Tools
Jeremy Diamond, Vincenzo Stoico

TL;DR
This study quantifies the overhead caused by RAPL-based energy monitoring tools at high sampling rates and proposes strategies to reduce this overhead for more accurate energy profiling.
Contribution
It provides an empirical analysis of RAPL tool overhead and introduces optimized methods to minimize measurement impact at high sampling frequencies.
Findings
Existing tools can cause up to 46.75% time overhead at 1 kHz.
Our tools significantly reduce system call and math overhead.
Accessing RAPL values with lower-level instructions improves measurement speed.
Abstract
The Running Average Power Limit (RAPL) interface is widely used to estimate software energy consumption via CPU and DRAM counters, but tool design differences and high-frequency polling can introduce measurement overhead, namely, extra time and energy consumed by the tool itself.This paper quantifies the impact of RAPL-based tools on high-frequency (1 kHz) energy monitoring and investigates mitigation strategies. We conduct two controlled experiments: the first evaluates seven tools, including a user-space application and a kernel module developed by the authors, against a no-tool baseline, using six NAS Benchmark functions to quantify overhead. The second experiment isolates and times key functions for polling Model-Specific Registers (MSRs) (rdmsr and sys/proc_read) to estimate their execution latencies and identify potential slowdowns. The results show that existing user-space tools…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
