Algorithmic Energy Saving for Parallel Cholesky, LU, and QR   Factorizations

Li Tan; Zizhong Chen

arXiv:1411.2536·cs.DC·April 3, 2015

Algorithmic Energy Saving for Parallel Cholesky, LU, and QR Factorizations

Li Tan, Zizhong Chen

PDF

Open Access

TL;DR

This paper introduces TX, a library-level DVFS scheduling method that analyzes task dependencies in matrix factorizations to significantly improve energy efficiency with minimal performance loss.

Contribution

The paper presents a novel, application-aware, library-level DVFS scheduling approach for matrix factorizations that outperforms OS-level solutions in energy savings.

Findings

01

TX saves up to 17.8% more energy than OS solutions.

02

Performance loss is negligible at 3.5% on average.

03

Applicable to Cholesky, LU, and QR factorizations.

Abstract

The pressing demands of improving energy efficiency for high performance scientific computing have motivated a large body of software-controlled hard- ware solutions using Dynamic Voltage and Frequency Scaling (DVFS) that strategically switch processors to low-power states, when the peak processor performance is not necessary. Although OS level solutions have demonstrated the effectiveness of saving energy in a black-box fashion, for applications with variable execution characteristics, the optimal energy efficiency can be blundered away due to defective prediction mechanism and untapped load imbalance. In this paper, we propose TX, a library level race-to-halt DVFS scheduling approach that analyzes Task Dependency Set of each task in parallel Cholesky, LU, and QR factorizations to achieve substantial energy savings OS level solutions cannot fulfill. Partially giving up the generality…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Interconnection Networks and Systems · Quantum Computing Algorithms and Architecture