Optimal estimation of variance in nonparametric regression with random design
Yandi Shen, Chao Gao, Daniela Witten, and Fang Han

TL;DR
This paper derives the optimal rates for estimating variance functions in heteroscedastic nonparametric regression with random design, extending previous fixed design results and proposing a U-statistic-based estimator.
Contribution
It establishes the minimax rates for variance estimation under both local and global risks in a heteroscedastic nonparametric regression with random design, extending fixed design results.
Findings
Minimax rate of variance estimation is n^{-rac{8etaeta}{4etaeta + 2eta + eta}} ext{ or } n^{-rac{2eta}{2eta+1}}.
Special case of constant variance yields a rate of n^{-8eta/(4eta+1)} ext{ or } n^{-1}.
Proposes a U-statistic-based local polynomial estimator that achieves the minimax rate.
Abstract
Consider the heteroscedastic nonparametric regression model with random design \begin{align*} Y_i = f(X_i) + V^{1/2}(X_i)\varepsilon_i, \quad i=1,2,\ldots,n, \end{align*} with and - and -H\"older smooth, respectively. We show that the minimax rate of estimating under both local and global squared risks is of the order \begin{align*} n^{-\frac{8\alpha\beta}{4\alpha\beta + 2\alpha + \beta}} \vee n^{-\frac{2\beta}{2\beta+1}}, \end{align*} where for any two real numbers . This result extends the fixed design rate derived in Wang et al. [2008] in a non-trivial manner, as indicated by the appearances of both and in the first term. In the special case of constant variance, we show that the minimax rate is for variance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Statistical Methods and Bayesian Inference
