PCA-Kernel Estimation
G\'erard Biau (LSTA, DMA), Andr\'e Mas (I3M)

TL;DR
This paper analyzes the asymptotic properties of kernel estimation methods applied after PCA-based dimension reduction, addressing the dependence issues introduced by data-driven projections in high-dimensional settings.
Contribution
It provides theoretical results establishing asymptotic equivalences between kernel quantities based on empirical and theoretical projectors, with detailed analysis of kernel regression.
Findings
Asymptotic equivalence between empirical and theoretical kernel quantities
Analysis of dependence introduced by PCA projection
Insights into kernel regression after PCA reduction
Abstract
Many statistical estimation techniques for high-dimensional or functional data are based on a preliminary dimension reduction step, which consists in projecting the sample onto the first eigenvectors of the Principal Component Analysis (PCA) associated with the empirical projector . Classical nonparametric inference methods such as kernel density estimation or kernel regression analysis are then performed in the (usually small) -dimensional space. However, the mathematical analysis of this data-driven dimension reduction scheme raises technical problems, due to the fact that the random variables of the projected sample are no more independent. As a reference for further studies, we offer in this paper several results showing the asymptotic equivalencies between important kernel-related quantities based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
