Measuring inter-cluster similarities with Alpha Shape TRIangulation in loCal Subspaces (ASTRICS) facilitates visualization and clustering of high-dimensional data
Joshua M. Scurll

TL;DR
The paper introduces ASTRICS, a parameter-free similarity measure for high-dimensional data clusters, enabling automatic graph construction for improved clustering and visualization without user-defined thresholds.
Contribution
ASTRICS provides a novel, threshold-free similarity measure based on local dimensionality reduction and alpha shape triangulation, facilitating automatic graph-based clustering and visualization of high-dimensional data.
Findings
ASTRICS effectively measures inter-cluster similarity without user-defined parameters.
The method improves clustering accuracy and visualization clarity.
ASTRICS enables automatic resolution selection in high-dimensional data analysis.
Abstract
Clustering and visualizing high-dimensional (HD) data are important tasks in a variety of fields. For example, in bioinformatics, they are crucial for analyses of single-cell data such as mass cytometry (CyTOF) data. Some of the most effective algorithms for clustering HD data are based on representing the data by nodes in a graph, with edges connecting neighbouring nodes according to some measure of similarity or distance. However, users of graph-based algorithms are typically faced with the critical but challenging task of choosing the value of an input parameter that sets the size of neighbourhoods in the graph, e.g. the number of nearest neighbours to which to connect each node or a threshold distance for connecting nodes. The burden on the user could be alleviated by a measure of inter-node similarity that can have value 0 for dissimilar nodes without requiring any user-defined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Advanced Clustering Algorithms Research · Complex Network Analysis Techniques
