Statistical analysis of Mapper for stochastic and multivariate filters
Mathieu Carri\`ere, Bertrand Michel

TL;DR
This paper extends the statistical analysis of Mapper to multivariate and metric space filters, providing risk bounds and applications in data analysis, especially for finite metric spaces and estimated filters.
Contribution
It introduces a modified Mapper construction with risk bounds for Reeb space estimation applicable to multivariate and metric space filters, including data-driven functions.
Findings
Provides risk bounds for Mapper in estimating Reeb spaces
Applicable to data-driven filters like PCA eigenfunctions
Demonstrates relevance through numerical experiments
Abstract
Reeb spaces, as well as their discretized versions called Mappers, are common descriptors used in Topological Data Analysis, with plenty of applications in various fields of science, such as computational biology and data visualization, among others. The stability and quantification of the rate of convergence of the Mapper to the Reeb space has been studied a lot in recent works [BBMW19, CO17, CMO18, MW16], focusing on the case where a scalar-valued filter is used for the computation of Mapper. On the other hand, much less is known in the multivariate case, when the codomain of the filter is , and in the general case, when it is a general metric space , instead of . The few results that are available in this setting [DMW17, MW16] can only handle continuous topological spaces and cannot be used as is for finite metric spaces representing data, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Geochemistry and Geologic Mapping
