DeepAtlas: a tool for effective manifold learning
Serena Hughes, Timothy Hamilton, Tom Kolokotrones, Eric J. Deeds

TL;DR
DeepAtlas is a novel algorithm that learns local manifold structures in high-dimensional data, assesses the manifold hypothesis, and enables generative modeling using deep neural networks.
Contribution
It introduces a method to generate local embeddings, train neural networks for mapping, and evaluate the manifold hypothesis in datasets.
Findings
DeepAtlas successfully learns manifold structures in test datasets.
Many real datasets, including single-cell RNA-sequencing, do not conform to the manifold hypothesis.
When data is from a manifold, DeepAtlas provides a generative model and geometric insights.
Abstract
Manifold learning builds on the "manifold hypothesis," which posits that data in high-dimensional datasets are drawn from lower-dimensional manifolds. Current tools generate global embeddings of data, rather than the local maps used to define manifolds mathematically. These tools also cannot assess whether the manifold hypothesis holds true for a dataset. Here, we describe DeepAtlas, an algorithm that generates lower-dimensional representations of the data's local neighborhoods, then trains deep neural networks that map between these local embeddings and the original data. Topological distortion is used to determine whether a dataset is drawn from a manifold and, if so, its dimensionality. Application to test datasets indicates that DeepAtlas can successfully learn manifold structures. Interestingly, many real datasets, including single-cell RNA-sequencing, do not conform to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
