Improved Algorithms for Kernel Matrix-Vector Multiplication Under Sparsity Assumptions
Piotr Indyk, Michael Kapralov, Kshiteej Sheth, Tal Wagner

TL;DR
This paper introduces new subquadratic algorithms for fast matrix-vector multiplication of Gaussian kernel matrices, leveraging sparsity assumptions validated in practical attention matrix scenarios.
Contribution
It presents the first subquadratic algorithm for Gaussian kernel matrices under a sparsity-based assumption, applicable to unrestricted vectors.
Findings
Algorithms achieve subquadratic runtime in n
Validation of sparsity assumption in real-world matrices
Applicable to fast attention in large language models
Abstract
Motivated by the problem of fast processing of attention matrices, we study fast algorithms for computing matrix-vector products for asymmetric Gaussian Kernel matrices . 's columns are indexed by a set of keys , rows by a set of queries , and its entry is for some bandwidth parameter . Given a vector and error parameter , our task is to output a such that in time subquadratic in and linear in . Our algorithms rely on the following modelling assumption about the matrices : the sum of the entries of scales linearly in , as opposed to worst case quadratic growth. We validate this assumption experimentally, for Gaussian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Medical Image Segmentation Techniques · Brain Tumor Detection and Classification
