Dynamic Clustering of Histogram Data Based on Adaptive Squared Wasserstein Distances
Antonio Irpino, Rosanna Verde, Francisco de AT De Carvalho

TL;DR
This paper introduces a dynamic clustering method for histogram data using adaptive squared Wasserstein distances, enabling better interpretation of complex distributional data in various applications.
Contribution
It proposes a novel adaptive squared Wasserstein distance-based clustering algorithm tailored for histogram data, with interpretative tools and validation on synthetic and real datasets.
Findings
Effective clustering of histogram data demonstrated on real-world datasets
Adaptive distances improve cluster interpretability
Method outperforms traditional clustering approaches
Abstract
This paper deals with clustering methods based on adaptive distances for histogram data using a dynamic clustering algorithm. Histogram data describes individuals in terms of empirical distributions. These kind of data can be considered as complex descriptions of phenomena observed on complex objects: images, groups of individuals, spatial or temporal variant data, results of queries, environmental data, and so on. The Wasserstein distance is used to compare two histograms. The Wasserstein distance between histograms is constituted by two components: the first based on the means, and the second, to internal dispersions (standard deviation, skewness, kurtosis, and so on) of the histograms. To cluster sets of histogram data, we propose to use Dynamic Clustering Algorithm, (based on adaptive squared Wasserstein distances) that is a k-means-like algorithm for clustering a set of individuals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
