Topologically Regularized Data Embeddings

Robin Vandaele; Bo Kang; Jefrey Lijffijt; Tijl De Bie; Yvan Saeys

arXiv:2110.09193·cs.LG·March 8, 2022

Topologically Regularized Data Embeddings

Robin Vandaele, Bo Kang, Jefrey Lijffijt, Tijl De Bie, Yvan Saeys

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces new topological loss functions to improve unsupervised data embeddings by incorporating prior topological knowledge, addressing limitations of existing methods, and demonstrating their effectiveness on synthetic and real datasets.

Contribution

The paper proposes a novel set of topological losses that naturally represent simple models and preserve original data structure, enhancing embedding quality.

Findings

01

Improved embedding quality on synthetic data.

02

Effective modeling of high-dimensional single-cell data.

03

Versatile application to graph embedding.

Abstract

Unsupervised feature learning often finds low-dimensional embeddings that capture the structure of complex data. For tasks for which prior expert topological knowledge is available, incorporating this into the learned representation may lead to higher quality embeddings. For example, this may help one to embed the data into a given number of clusters, or to accommodate for noise that prevents one from deriving the distribution of the data over the model directly, which can then be learned more effectively. However, a general tool for integrating different prior topological knowledge into embeddings is lacking. Although differentiable topology layers have been recently developed that can (re)shape embeddings into prespecified topological models, they have two important limitations for representation learning, which we address in this paper. First, the currently suggested topological…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

robinvndaele/topembedding
pytorchOfficial

Videos

Topologically Regularized Data Embeddings· slideslive

Taxonomy

TopicsTopological and Geometric Data Analysis · Single-cell and spatial transcriptomics · Cell Image Analysis Techniques