FLOPs as a Discriminant for Dense Linear Algebra Algorithms
Francisco L\'opez, Lars Karlsson, Paolo Bientinesi

TL;DR
This paper investigates whether FLOP counts reliably predict the fastest dense linear algebra algorithms, finding that anomalies exist and FLOPs alone are insufficient for optimal algorithm selection.
Contribution
The study demonstrates that FLOP counts are unreliable discriminants for dense linear algebra algorithms and highlights the need for combined performance models.
Findings
Anomalies in FLOP-based algorithm selection exist and cluster in regions.
FLOP count is not a dependable predictor of algorithm performance.
Most anomalies persist even after filtering cache effects.
Abstract
Expressions that involve matrices and vectors, known as linear algebra expressions, are commonly evaluated through a sequence of invocations to highly optimised kernels provided in libraries such as BLAS and LAPACK. A sequence of kernels represents an algorithm, and in general, because of associativity, algebraic identities, and multiple kernels, one expression can be evaluated via many different algorithms. These algorithms are all mathematically equivalent (i.e., in exact arithmetic, they all compute the same result), but often differ noticeably in terms of execution time. When faced with a decision, high-level languages, libraries, and tools such as Julia, Armadillo, and Linnea choose by selecting the algorithm that minimises the FLOP count. In this paper, we test the validity of the FLOP count as a discriminant for dense linear algebra algorithms, analysing "anomalies": problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
