Nonparametric Conditional Density Estimation in a High-Dimensional Regression Setting
Rafael Izbicki, Ann B. Lee

TL;DR
This paper introduces a new nonparametric method for estimating the full conditional density in high-dimensional regression settings, effectively capturing complex, multi-modal distributions.
Contribution
It proposes a kernel eigenfunction expansion approach that adapts to intrinsic data structure, avoiding high-dimensional tensor products and density ratios.
Findings
Effective on images, spectra, and galaxy redshift data.
Achieves adaptive convergence rates based on data intrinsic dimension.
Demonstrates improved density estimation in complex, high-dimensional scenarios.
Abstract
In some applications (e.g., in cosmology and economics), the regression E[Z|x] is not adequate to represent the association between a predictor x and a response Z because of multi-modality and asymmetry of f(z|x); using the full density instead of a single-point estimate can then lead to less bias in subsequent analysis. As of now, there are no effective ways of estimating f(z|x) when x represents high-dimensional, complex data. In this paper, we propose a new nonparametric estimator of f(z|x) that adapts to sparse (low-dimensional) structure in x. By directly expanding f(z|x) in the eigenfunctions of a kernel-based operator, we avoid tensor products in high dimensions as well as ratios of estimated densities. Our basis functions are orthogonal with respect to the underlying data distribution, allowing fast implementation and tuning of parameters. We derive rates of convergence and show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
