Establishing Validity for Distance Functions and Internal Clustering Validity Indices in Correlation Space
Isabella Degen, Zahraa S Abdallah, Kate Robson Brown, Henry W J Reeve

TL;DR
This paper develops a theoretical framework for evaluating clustering validity indices specifically for correlation-based structures in time series data, revealing which indices reliably measure clustering quality for this structure type.
Contribution
It introduces the first validity assessment for correlation pattern clustering, formalizes canonical correlation structures, and provides a structure-specific validation methodology.
Findings
SWC and DBI are valid for correlation patterns
VRC and PBM indices fail for correlation structures
Simple Lp norm distances are valid, correlation-specific functions are not
Abstract
Internal clustering validity indices (ICVIs) assess clustering quality without ground truth labels. Comparative studies consistently find that no single ICVI outperforms others across datasets, leaving practitioners without principled ICVI selection. We argue that inconsistent ICVI performance arises because studies evaluate them based on matching human labels rather than measuring the quality of the discovered structure in the data, using datasets without formally quantifying the structure type and quality. Structure type refers to the mathematical organisation in data that clustering aims to discover. Validity theory requires a theoretical definition of clustering quality, which depends on structure type. We demonstrate this through the first validity assessment of clustering quality measures for correlation patterns, a structure type that arises from clustering time series by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting
