Communication efficient application of sequences of planar rotations to   a matrix

Thijs Steel; Julien Langou

arXiv:2412.01852·cs.PF·December 4, 2024

Communication efficient application of sequences of planar rotations to a matrix

Thijs Steel, Julien Langou

PDF

Open Access

TL;DR

This paper introduces a highly efficient algorithm for applying sequences of planar rotations to matrices, optimizing for register reuse, cache efficiency, and providing thorough analysis to enhance performance in numerical linear algebra tasks.

Contribution

The paper presents a novel algorithm with optimized kernel, blocking scheme, and detailed memory analysis, outperforming existing methods in applying planar rotations.

Findings

01

Achieves near-peak flop rate on modern hardware

02

Outperforms state-of-the-art algorithms in efficiency

03

Provides theoretical insights into memory operations

Abstract

We present an efficient algorithm for the application of sequences of planar rotations to a matrix. Applying such sequences efficiently is important in many numerical linear algebra algorithms for eigenvalues. Our algorithm is novel in three main ways. First, we introduce a new kernel that is optimized for register reuse in a novel way. Second, we introduce a blocking and packing scheme that improves the cache efficiency of the algorithm. Finally, we thoroughly analyze the memory operations of the algorithm which leads to important theoretical insights and makes it easier to select good parameters. Numerical experiments show that our algorithm outperforms the state-of-the-art and achieves a flop rate close to the theoretical peak on modern hardware.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topicsgraph theory and CDMA systems · Computational Geometry and Mesh Generation · Digital Image Processing Techniques