VertCoHiRF: Decentralized Vertical Clustering Beyond k-means
Bruno Belucci, Karim Lounici, Vladimir R. Kostic, Katia Meziani

TL;DR
VertCoHiRF introduces a fully decentralized vertical federated clustering framework that achieves hierarchical consensus without sharing feature-dependent data, enhancing privacy, robustness, and interpretability in multi-party settings.
Contribution
It proposes a novel decentralized clustering method that relies on identifier-level consensus and ordinal rankings, avoiding centralized coordination and feature sharing.
Findings
Competitive clustering performance demonstrated in experiments.
Supports overlapping feature partitions and heterogeneous local methods.
Provides an interpretable hierarchical clustering structure.
Abstract
Vertical Federated Learning (VFL) enables collaborative analysis across parties holding complementary feature views of the same samples, yet existing approaches are largely restricted to distributed variants of -means, requiring centralized coordination or the exchange of feature-dependent numerical statistics, and exhibiting limited robustness under heterogeneous views or adversarial behavior. We introduce VertCoHiRF, a fully decentralized framework for vertical federated clustering based on structural consensus across heterogeneous views, allowing each agent to apply a base clustering method adapted to its local feature space in a peer-to-peer manner. Rather than exchanging feature-dependent statistics or relying on noise injection for privacy, agents cluster their local views independently and reconcile their proposals through identifier-level consensus. Consensus is achieved via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning · Data-Driven Disease Surveillance
