Optimal Variable Clustering for High-Dimensional Matrix Valued Data
Inbeom Lee, Siyi Deng, Yang Ning

TL;DR
This paper introduces a new hierarchical clustering method for matrix-valued data that leverages dependence structures, providing theoretical guarantees and demonstrating superior performance over existing methods in high-dimensional settings.
Contribution
It proposes a novel latent variable model and a hierarchical clustering algorithm with theoretical consistency and optimal weight selection for high-dimensional matrix data.
Findings
Algorithm achieves clustering consistency under mild conditions.
Optimal weight guarantees minimax rate-optimality.
Outperforms existing methods in simulation studies.
Abstract
Matrix valued data has become increasingly prevalent in many applications. Most of the existing clustering methods for this type of data are tailored to the mean model and do not account for the dependence structure of the features, which can be very informative, especially in high-dimensional settings or when mean information is not available. To extract the information from the dependence structure for clustering, we propose a new latent variable model for the features arranged in matrix form, with some unknown membership matrices representing the clusters for the rows and columns. Under this model, we further propose a class of hierarchical clustering algorithms using the difference of a weighted covariance matrix as the dissimilarity measure. Theoretically, we show that under mild conditions, our algorithm attains clustering consistency in the high-dimensional setting. While this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Face and Expression Recognition · Complex Network Analysis Techniques
