The Mathematics Behind Spectral Clustering And The Equivalence To PCA
T Shen

TL;DR
This paper explains the mathematical foundations of spectral clustering, clarifies its connection to PCA, and distinguishes its behavior based on graph connectivity, providing theoretical insights into its operation.
Contribution
It offers a detailed mathematical explanation of spectral clustering, including its equivalence to PCA and the effects of graph connectivity on eigenvectors.
Findings
Spectral clustering can be understood through an objective function based on data similarities.
For fully connected graphs, the first eigenvectors relate to data covariance.
In multi-connected graphs, eigenvectors indicate connected components.
Abstract
Spectral clustering is a popular algorithm that clusters points using the eigenvalues and eigenvectors of Laplacian matrices derived from the data. For years, spectral clustering has been working mysteriously. This paper explains spectral clustering by dividing it into two categories based on whether the graph Laplacian is fully connected or not. For a fully connected graph, this paper demonstrates the dimension reduction part by offering an objective function: the covariance between the original data points' similarities and the mapped data points' similarities. For a multi-connected graph, this paper proves that with a proper , the first eigenvectors are the indicators of the connected components. This paper also proves there is an equivalence between spectral embedding and PCA.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Face and Expression Recognition · Image Retrieval and Classification Techniques
MethodsSpectral Clustering · Principal Components Analysis
