Quantitative asymptotics of graphical projection pursuit
Elizabeth Meckes

TL;DR
This paper provides quantitative bounds on how close one-dimensional projections of high-dimensional data are to Gaussian distributions, extending classical asymptotic results with explicit probabilistic estimates.
Contribution
It offers explicit bounds on the deviation of projected data distributions from Gaussianity, improving understanding of the rate of convergence in graphical projection pursuit.
Findings
Derived explicit probabilistic bounds for Gaussian approximation of projections.
Provided bounds on the bounded-Lipschitz distance between empirical and Gaussian measures.
Established lower bounds on the waiting time to find non-Gaussian projections.
Abstract
There is a result of Diaconis and Freedman which says that, in a limiting sense, for large collections of high-dimensional data most one-dimensional projections of the data are approximately Gaussian. This paper gives quantitative versions of that result. For a set of deterministic vectors in with and fixed, let be a random point of the sphere and let denote the random measure which puts mass at each of the points . For a fixed bounded Lipschitz test function , a standard Gaussian random variable and a suitable constant, an explicit bound is derived for the quantity . A bound is also given for , where denotes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
