Post-clustering Inference under Dependence
Javier Gonz\'alez-Delgado, Mathis Deronzier, Juan Cort\'es, Pierre Neuvial

TL;DR
This paper extends post-clustering inference methods to handle arbitrary dependence structures in data, broadening their applicability beyond the independent Gaussian case, and demonstrates their effectiveness on synthetic and real datasets.
Contribution
It develops a theoretical framework for post-clustering inference under dependence, applicable to hierarchical clustering and k-means, with conditions for covariance estimation.
Findings
Framework extended to dependent data structures
Conditions established for covariance matrix estimation
Validated on synthetic and protein structure data
Abstract
Recent work by Gao et al. (JASA 2022) has laid the foundations for post-clustering inference, establishing a theoretical framework allowing to test for differences between means of estimated clusters. Additionally, they studied the estimation of unknown parameters while controlling the selective type I error. However, their theory was developed for independent observations identically distributed as -dimensional Gaussian variables, where the parameter estimation could only be performed for spherical covariance matrices. Here, we aim at extending this framework to a more convenient scenario for practical applications, where arbitrary dependence structures between observations and features are allowed. We establish sufficient conditions for extending the setting presented by Gao et al. to the general dependence framework. Moreover, we assess theoretical conditions allowing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Bayesian Methods and Mixture Models · Gene expression and cancer classification
