Data-Driven Linear Complexity Low-Rank Approximation of General Kernel Matrices: A Geometric Approach
Difeng Cai, Edmond Chow, Yuanzhe Xi

TL;DR
This paper introduces a geometric approach for efficiently approximating large, arbitrarily distributed kernel matrices with low rank, crucial for scalable kernel methods like Gaussian process regression.
Contribution
It proposes a novel geometric method for selecting point subsets to construct low-rank kernel approximations, avoiding matrix formation and enabling near-linear scaling.
Findings
Method achieves linear or near-linear complexity for large datasets.
Guidelines for geometric point selection improve approximation quality.
Applicable to high-dimensional, arbitrarily distributed data sets.
Abstract
A general, {\em rectangular} kernel matrix may be defined as where is a kernel function and where and are two sets of points. In this paper, we seek a low-rank approximation to a kernel matrix where the sets of points and are large and are arbitrarily distributed, such as away from each other, ``intermingled'', identical, etc. Such rectangular kernel matrices may arise, for example, in Gaussian process regression where corresponds to the training data and corresponds to the test data. In this case, the points are often high-dimensional. Since the point sets are large, we must exploit the fact that the matrix arises from a kernel function, and avoid forming the matrix, and thus ruling out most algebraic techniques. In particular, we seek methods that can scale linearly or nearly linear with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical and numerical algorithms · Face and Expression Recognition
MethodsTest · Gaussian Process
