Selective inference after convex clustering with $\ell_1$ penalization
Fran\c{c}ois Bachoc, Cathy Maugis-Rabusseau, Pierre Neuvial

TL;DR
This paper develops a selective inference framework for convex clustering with penalization, providing statistical guarantees and efficient algorithms for testing differences in cluster means.
Contribution
It introduces a polyhedral characterization for clustering with penalization, enabling valid statistical tests and efficient computation in both one-dimensional and multi-dimensional settings.
Findings
Validated statistical guarantees through numerical experiments
Demonstrated power to detect mean differences between clusters
Implemented methods in the R package poclin
Abstract
Classical inference methods notoriously fail when applied to data-driven test hypotheses or inference targets. Instead, dedicated methodologies are required to obtain statistical guarantees for these selective inference problems. Selective inference is particularly relevant post-clustering, typically when testing a difference in mean between two clusters. In this paper, we address convex clustering with penalization, by leveraging related selective inference tools for regression, based on Gaussian vectors conditioned to polyhedral sets. In the one-dimensional case, we prove a polyhedral characterization of obtaining given clusters, than enables us to suggest a test procedure with statistical guarantees. This characterization also allows us to provide a computationally efficient regularization path algorithm. Then, we extend the above test procedure and guarantees to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference
