A Domain-Oblivious Approach for Learning Concise Representations of Filtered Topological Spaces for Clustering
Yu Qin, Brittany Terese Fasy, Carola Wenk, and Brian Summa

TL;DR
This paper introduces a domain-oblivious hashing framework for persistence diagrams that enables fast, scalable, and accurate topological data analysis through binary codes and GANs, improving clustering and similarity computations.
Contribution
It proposes a novel GAN-based hashing method for persistence diagrams that is domain-oblivious, scalable, and improves computational efficiency in topological data analysis.
Findings
Binary codes better preserve topological similarity.
Scalable to 10,000 diagrams with improved speed.
Comparable or better clustering quality.
Abstract
Persistence diagrams have been widely used to quantify the underlying features of filtered topological spaces in data visualization. In many applications, computing distances between diagrams is essential; however, computing these distances has been challenging due to the computational cost. In this paper, we propose a persistence diagram hashing framework that learns a binary code representation of persistence diagrams, which allows for fast computation of distances. This framework is built upon a generative adversarial network (GAN) with a diagram distance loss function to steer the learning process. Instead of using standard representations, we hash diagrams into binary codes, which have natural advantages in large-scale tasks. The training of this model is domain-oblivious in that it can be computed purely from synthetic, randomly created diagrams. As a consequence, our proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Cell Image Analysis Techniques · Advanced Neuroimaging Techniques and Applications
