Consistent spectral clustering in sparse tensor block models
Ian V\"alimaa, Lasse Leskel\"a

TL;DR
This paper develops a spectral clustering method for sparse tensor block models, providing theoretical guarantees for consistency and analyzing the effects of data aggregation in high-dimensional, sparse settings.
Contribution
It introduces a new tensor block model for sparse data, proposes a spectral clustering algorithm with a trimming step, and establishes a density threshold for consistency.
Findings
The proposed method is consistent under certain sparsity conditions.
A new concentration bound for sparse random Gram matrices is derived.
Numerical experiments validate the theoretical results.
Abstract
High-order clustering aims to classify objects in multiway datasets that are prevalent in various fields such as bioinformatics, recommendation systems, and social network analysis. Such data are often sparse and high-dimensional, posing significant statistical and computational challenges. This paper introduces a tensor block model specifically designed for sparse integer-valued data tensors. We propose a simple spectral clustering algorithm augmented with a trimming step to mitigate noise fluctuations, and identify a density threshold that ensures the algorithm's consistency. Our approach models sparsity using a sub-Poisson noise concentration framework, accommodating heavier than sub-Gaussian tails. Remarkably, this natural class of tensor block models is closed under aggregation across arbitrary modes. Consequently, we obtain a comprehensive framework for evaluating the tradeoff…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Functional Brain Connectivity Studies · Tensor decomposition and applications
MethodsSpectral Clustering
