# Performance Modeling and Prediction for Dense Linear Algebra

**Authors:** Elmar Peise (HPAC, RWTH Aachen)

arXiv: 1706.01341 · 2017-06-06

## TL;DR

This dissertation develops measurement-based performance models for dense linear algebra algorithms that predict execution times without running the algorithms, enabling quick selection of optimal configurations.

## Contribution

It introduces automatic, platform-specific performance models for kernel operations and cache-aware micro-benchmarks for tensor contractions, improving prediction accuracy and efficiency.

## Key findings

- Accurately predicts the fastest algorithms and configurations.
- Enables rapid performance estimation without executing algorithms.
- Identifies optimal tensor traversal and kernel combinations efficiently.

## Abstract

This dissertation introduces measurement-based performance modeling and prediction techniques for dense linear algebra algorithms. As a core principle, these techniques avoid executions of such algorithms entirely, and instead predict their performance through runtime estimates for the underlying compute kernels. For a variety of operations, these predictions allow to quickly select the fastest algorithm configurations from available alternatives. We consider two scenarios that cover a wide range of computations:   To predict the performance of blocked algorithms, we design algorithm-independent performance models for kernel operations that are generated automatically once per platform. For various matrix operations, instantaneous predictions based on such models both accurately identify the fastest algorithm, and select a near-optimal block size.   For performance predictions of BLAS-based tensor contractions, we propose cache-aware micro-benchmarks that take advantage of the highly regular structure inherent to contraction algorithms. At merely a fraction of a contraction's runtime, predictions based on such micro-benchmarks identify the fastest combination of tensor traversal and compute kernel.

## Figures

40 figures with captions in the complete paper: https://tomesphere.com/paper/1706.01341/full.md

---
Source: https://tomesphere.com/paper/1706.01341