Dynamic Clustering of Histogram Data Based on Adaptive Squared   Wasserstein Distances

Antonio Irpino; Rosanna Verde; Francisco de AT De Carvalho

arXiv:1110.1462·math.ST·May 3, 2016

Dynamic Clustering of Histogram Data Based on Adaptive Squared Wasserstein Distances

Antonio Irpino, Rosanna Verde, Francisco de AT De Carvalho

PDF

TL;DR

This paper introduces a dynamic clustering method for histogram data using adaptive squared Wasserstein distances, enabling better interpretation of complex distributional data in various applications.

Contribution

It proposes a novel adaptive squared Wasserstein distance-based clustering algorithm tailored for histogram data, with interpretative tools and validation on synthetic and real datasets.

Findings

01

Effective clustering of histogram data demonstrated on real-world datasets

02

Adaptive distances improve cluster interpretability

03

Method outperforms traditional clustering approaches

Abstract

This paper deals with clustering methods based on adaptive distances for histogram data using a dynamic clustering algorithm. Histogram data describes individuals in terms of empirical distributions. These kind of data can be considered as complex descriptions of phenomena observed on complex objects: images, groups of individuals, spatial or temporal variant data, results of queries, environmental data, and so on. The Wasserstein distance is used to compare two histograms. The Wasserstein distance between histograms is constituted by two components: the first based on the means, and the second, to internal dispersions (standard deviation, skewness, kurtosis, and so on) of the histograms. To cluster sets of histogram data, we propose to use Dynamic Clustering Algorithm, (based on adaptive squared Wasserstein distances) that is a k-means-like algorithm for clustering a set of individuals…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.