Model-Based Clustering of Functional Data Via Random Projection Ensembles
Matteo Mori, Laura Anderlucci

TL;DR
This paper introduces a novel clustering framework for functional data using random projections and ensemble consensus, improving stability and robustness without predefining the number of clusters.
Contribution
It proposes a new ensemble-based clustering method leveraging random projections and Gaussian Mixture Models, with automatic cluster number determination.
Findings
Effective in simulation studies
Successful application to spectroscopy data
Accurate clustering of speech log-periodograms
Abstract
Clustering functional data is a challenging task due to intrinsic infinite-dimensionality and the need for stable, data-adaptive partitioning. In this work, we propose a clustering framework based on Random Projections, which simultaneously performs dimensionality reduction and generates multiple stochastic representations of the original functions. Each projection is clustered independently, and the resulting partitions are then aggregated through an ensemble consensus procedure, enhancing robustness and mitigating the influence of any single projection. To focus on the most informative representations, projections are ranked according to clustering quality criteria, and only a selected subset is retained. In particular, we adopt Gaussian Mixture Models as base clusterers and employ the Kullback-Leibler divergence to order the random projections; these choices enable fast computation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Neural Networks and Applications
