Improving Energy Saving of One-sided Matrix Decompositions on CPU-GPU Heterogeneous Systems
Jieyang Chen, Xin Liang, Kai Zhao, Hadi Zamani Sabzi, Laxmi Bhuyan,, Zizhong Chen

TL;DR
This paper introduces BSR, a framework combining fault tolerance overclocking and DVFS to significantly reduce energy consumption in matrix decompositions on CPU-GPU systems while maintaining performance.
Contribution
The paper proposes BSR, an energy-saving framework that integrates ABFT-OC overclocking and DVFS for efficient matrix decompositions on heterogeneous systems.
Findings
BSR saves up to 11.7% more energy than existing methods.
BSR achieves up to 14.1% Energy * Delay^2 reduction.
BSR enables performance improvements up to 1.43x without extra energy cost.
Abstract
One-sided dense matrix decompositions (e.g., Cholesky, LU, and QR) are the key components in scientific computing in many different fields. Although their design has been highly optimized for modern processors, they still consume a considerable amount of energy. As CPU-GPU heterogeneous systems are commonly used for matrix decompositions, in this work, we aim to further improve the energy saving of one-sided matrix decompositions on CPU-GPU heterogeneous systems. We first build an Algorithm-Based Fault Tolerance protected overclocking technique (ABFT-OC) to enable us to exploit reliable overclocking for key matrix decomposition operations. Then, we design an energy-saving matrix decomposition framework, Bi-directional Slack Reclamation(BSR), that can intelligently combine the capability provided by ABFT-OC and DVFS to maximize energy saving and maintain performance and reliability.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
