Parametric Hierarchical Matrix Approximations to Kernel Matrices
Abraham Khan, Chao Chen, Vishwas Rao, Arvind K. Saibaba

TL;DR
This paper introduces parametric hierarchical matrices that efficiently approximate kernel matrices depending on hyperparameters, enabling rapid online computations and significant speedups over existing methods.
Contribution
The paper develops a new class of parametric hierarchical matrices with an offline-online construction, allowing efficient hyperparameter-dependent kernel matrix approximations.
Findings
Over 100x speedup in numerical experiments.
Efficient online stage with no new kernel evaluations.
Comparable offline costs to standard hierarchical matrix methods.
Abstract
Kernel matrices are ubiquitous in computational mathematics, often arising from applications in machine learning and scientific computing. In two or three spatial or feature dimensions, such problems can be approximated efficiently by a class of matrices known as hierarchical matrices. A hierarchical matrix consists of a hierarchy of small near-field blocks (or sub-matrices) stored in a dense format and large far-field blocks approximated by low-rank matrices. Standard methods for forming hierarchical matrices do not account for the fact that kernel matrices depend on specific hyperparameters; for example, in the context of Gaussian processes, hyperparameters must be optimized over a fixed parameter space. We introduce a new class of hierarchical matrices, namely, parametric (parameter-dependent) hierarchical matrices. Members of this new class are parametric -matrices and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Tensor decomposition and applications · Gaussian Processes and Bayesian Inference
