On the conditional distributions of low-dimensional projections from high-dimensional data
Hannes Leeb

TL;DR
This paper demonstrates that in high-dimensional data, the conditional variance of a low-dimensional projection remains approximately constant given another projection, under certain regularity conditions, extending understanding of linear submodels.
Contribution
It provides the first theoretical result showing the approximate constancy of conditional variance in high-dimensional projections, broadening the scope of linear model assumptions.
Findings
Conditional mean is approximately linear in high dimensions.
Conditional variance is approximately constant in high dimensions.
Results hold uniformly for most projection directions.
Abstract
We study the conditional distribution of low-dimensional projections from high-dimensional data, where the conditioning is on other low-dimensional projections. To fix ideas, consider a random d-vector Z that has a Lebesgue density and that is standardized so that and . Moreover, consider two projections defined by unit-vectors and , namely a response and an explanatory variable . It has long been known that the conditional mean of y given x is approximately linear in x\alpha$…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
