Learning Process Energy Profiles from Node-Level Power Data
By: Jonathan Bader , Julius Irion , Jannis Kappel and more
Potential Business Impact:
Tracks computer energy use by each program.
The growing demand for data center capacity, driven by the growth of high-performance computing, cloud computing, and especially artificial intelligence, has led to a sharp increase in data center energy consumption. To improve energy efficiency, gaining process-level insights into energy consumption is essential. While node-level energy consumption data can be directly measured with hardware such as power meters, existing mechanisms for estimating per-process energy usage, such as Intel RAPL, are limited to specific hardware and provide only coarse-grained, domain-level measurements. Our proposed approach models per-process energy profiles by leveraging fine-grained process-level resource metrics collected via eBPF and perf, which are synchronized with node-level energy measurements obtained from an attached power distribution unit. By statistically learning the relationship between process-level resource usage and node-level energy consumption through a regression-based model, our approach enables more fine-grained per-process energy predictions.
Similar Papers
Strategies to Measure Energy Consumption Using RAPL During Workflow Execution on Commodity Clusters
Distributed, Parallel, and Cluster Computing
Measures how much power science computer jobs use.
Fine-Grained Energy Prediction For Parallellized LLM Inference With PIE-P
Distributed, Parallel, and Cluster Computing
Predicts computer energy use for faster AI.
Runtime Energy Monitoring for RISC-V Soft-Cores
Hardware Architecture
Measures computer energy use without complex math.