Convergence rates of least squares regression estimators with heavy-tailed errors
Qiyang Han, Jon A. Wellner

TL;DR
This paper analyzes the convergence rates of least squares estimators in nonparametric regression with heavy-tailed errors, revealing conditions under which the estimator performs optimally or poorly.
Contribution
It establishes the convergence rate of the LSE under heavy-tailed errors with a new multiplier inequality and explores the impact of error moments and covariate independence.
Findings
LSE converges at a rate depending on the entropy exponent and error moments.
Optimal convergence rate matches Gaussian error case when errors have sufficient moments.
Dependence between errors and covariates can cause arbitrarily slow convergence.
Abstract
We study the performance of the Least Squares Estimator (LSE) in a general nonparametric regression model, when the errors are independent of the covariates but may only have a -th moment (). In such a heavy-tailed regression setting, we show that if the model satisfies a standard `entropy condition' with exponent , then the loss of the LSE converges at a rate \begin{align*} \mathcal{O}_{\mathbf{P}}\big(n^{-\frac{1}{2+\alpha}} \vee n^{-\frac{1}{2}+\frac{1}{2p}}\big). \end{align*} Such a rate cannot be improved under the entropy condition alone. This rate quantifies both some positive and negative aspects of the LSE in a heavy-tailed regression setting. On the positive side, as long as the errors have moments, the loss of the LSE converges at the same rate as if the errors are Gaussian. On the negative side, if…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Sparse and Compressive Sensing Techniques
