Batch Self Organizing maps for distributional data using adaptive distances
Antonio Irpino, Francisco De Carvalho, Rosanna Verde, Antonio, Balzanella

TL;DR
This paper introduces an adaptive Batch Self Organizing Map algorithm for distributional data, automatically learning variable relevance weights and distribution components to improve clustering and analysis.
Contribution
It proposes a novel adaptive DBSOM algorithm that learns variable and component relevance weights for distributional data using Wasserstein distance.
Findings
Effective in real and synthetic datasets
Automatically learns relevance weights for variables and distribution components
Improves clustering quality for distributional data
Abstract
The paper deals with a Batch Self Organizing Map algorithm (DBSOM) for data described by distributional-valued variables. This kind of variables is characterized to take as values one-dimensional probability or frequency distributions on a numeric support. The objective function optimized in the algorithm depends on the choice of the distance measure. According to the nature of the date, the Wasserstein distance is proposed as one of the most suitable metrics to compare distributions. It is widely used in several contexts of analysis of distributional data. Conventional batch SOM algorithms consider that all variables are equally important for the training of the SOM. However, it is well known that some variables are less relevant than others for this task. In order to take into account the different contribution of the variables we propose an adaptive version of the DBSOM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Bayesian Methods and Mixture Models · Neural Networks and Applications
