Noisy data clusters are hollow
Fran\c{c}ois L\'eonard (IREQ)

TL;DR
This paper introduces a new perspective in multidimensional statistics where noisy data points form a shell around a topological manifold, changing how likelihoods are computed in noisy measurement scenarios.
Contribution
It proposes the shell manifold concept, replacing diffuse clouds with a shell structure, and analyzes the distribution of realization-to-manifold distances under increasing dimensions.
Findings
Realizations form a shell around the manifold in high dimensions.
The realization-to-shell distance follows a normal distribution as dimensions increase.
Likelihood estimation should consider the realization-to-shell distance rather than the realization-to-manifold distance.
Abstract
A new vision in multidimensional statistics is proposed impacting severalareas of application. In these applications, a set of noisy measurementscharacterizing the repeatable response of a process is known as a realizationand can be seen as a single point in . The projections of thispoint on the N axes correspond to the N measurements. The contemporary visionof a diffuse cloud of realizations distributed in is replaced bya cloud in the shape of a shell surrounding a topological manifold. Thismanifold corresponds to the process's stabilized-response domain observedwithout the measurement noise. The measurement noise, which accumulates overseveral dimensions, distances each realization from the manifold. Theprobability density function (PDF) of the realization-to-manifold distancecreates the shell. Considering the central limit theorem as the number…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Cell Image Analysis Techniques · Bioinformatics and Genomic Networks
