Using an expert deviation carrying the knowledge of climate data in usual clustering algorithms
Emmanuel Biabiany, Vincent Page, Didier Bernard, H\'el\`ene, Paugam-Moisy

TL;DR
This paper introduces a novel dissimilarity measure called Expert Deviation (ED) based on climate knowledge, improving clustering analysis of wind and rainfall data for better physical interpretability of atmospheric structures.
Contribution
The paper proposes replacing Euclidean distance with Expert Deviation in clustering algorithms, enhancing the physical relevance and interpretability of climate data clusters.
Findings
ED-based clustering yields more physically meaningful clusters.
KMS-ED discriminates daily climate situations better than KMS-L2.
Clusters identified with ED align with atmospheric structures.
Abstract
In order to help physicists to expand their knowledge of the climate in the Lesser Antilles, we aim to identify the spatio-temporal configurations using clustering analysis on wind speed and cumulative rainfall datasets. But we show that using the L2 norm in conventional clustering methods as K-Means (KMS) and Hierarchical Agglomerative Clustering (HAC) can induce undesirable effects. So, we propose to replace Euclidean distance (L2) by a dissimilarity measure named Expert Deviation (ED). Based on the symmetrized Kullback-Leibler divergence, the ED integrates the properties of the observed physical parameters and climate knowledge. This measure helps comparing histograms of four patches, corresponding to geographical zones, that are influenced by atmospheric structures. The combined evaluation of the internal homogeneity and the separation of the clusters obtained using ED and L2 was…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Clustering Algorithms Research · Soil Geostatistics and Mapping
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
