Selective inference after convex clustering with $\ell_1$ penalization

Fran\c{c}ois Bachoc; Cathy Maugis-Rabusseau; Pierre Neuvial

arXiv:2309.01492·stat.ME·November 11, 2025

Selective inference after convex clustering with $\ell_1$ penalization

Fran\c{c}ois Bachoc, Cathy Maugis-Rabusseau, Pierre Neuvial

PDF

Open Access

TL;DR

This paper develops a selective inference framework for convex clustering with penalization, providing statistical guarantees and efficient algorithms for testing differences in cluster means.

Contribution

It introduces a polyhedral characterization for clustering with penalization, enabling valid statistical tests and efficient computation in both one-dimensional and multi-dimensional settings.

Findings

01

Validated statistical guarantees through numerical experiments

02

Demonstrated power to detect mean differences between clusters

03

Implemented methods in the R package poclin

Abstract

Classical inference methods notoriously fail when applied to data-driven test hypotheses or inference targets. Instead, dedicated methodologies are required to obtain statistical guarantees for these selective inference problems. Selective inference is particularly relevant post-clustering, typically when testing a difference in mean between two clusters. In this paper, we address convex clustering with $ℓ_{1}$ penalization, by leveraging related selective inference tools for regression, based on Gaussian vectors conditioned to polyhedral sets. In the one-dimensional case, we prove a polyhedral characterization of obtaining given clusters, than enables us to suggest a test procedure with statistical guarantees. This characterization also allows us to provide a computationally efficient regularization path algorithm. Then, we extend the above test procedure and guarantees to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference