Distance preserving model order reduction of graph-Laplacians and cluster analysis
Vladimir Druskin, Alexander V. Mamonov, Mikhail Zaslavsky

TL;DR
This paper introduces novel algorithms for graph-Laplacian dimension reduction focusing on a target subset of nodes, enabling efficient spectral clustering that preserves global distances and reduces computational costs for large datasets.
Contribution
The work develops reduced-order models that accurately approximate the diffusion transfer function of graph-Laplacians for target subsets, improving efficiency in spectral clustering tasks.
Findings
Reduces Krylov subspace dimension for target subsets
Ensures clustering consistency with full data spectral clustering
Enables efficient clustering with smaller computational resources
Abstract
Graph-Laplacians and their spectral embeddings play an important role in multiple areas of machine learning. This paper is focused on graph-Laplacian dimension reduction for the spectral clustering of data as a primary application. Spectral embedding provides a low-dimensional parametrization of the data manifold which makes the subsequent task (e.g., clustering) much easier. However, despite reducing the dimensionality of data, the overall computational cost may still be prohibitive for large data sets due to two factors. First, computing the partial eigendecomposition of the graph-Laplacian typically requires a large Krylov subspace. Second, after the spectral embedding is complete, one still has to operate with the same number of data points. For example, clustering of the embedded data is typically performed with various relaxations of k-means which computational cost scales poorly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSpectral Clustering
