Suboptimality of Constrained Least Squares and Improvements via Non-Linear Predictors
Tomas Va\v{s}kevi\v{c}ius, Nikita Zhivotovskiy

TL;DR
This paper demonstrates that constrained least squares can be suboptimal in bounded settings and shows that non-linear predictors can achieve optimal rates without distributional assumptions.
Contribution
It proves the suboptimality of constrained least squares in bounded distributions and highlights the effectiveness of non-linear predictors for optimal risk rates.
Findings
Constrained least squares does not always attain the $O(d/n)$ rate.
Constructed bounded distribution causes $ ilde{ ext{Omega}}(d^{3/2}/n)$ excess risk.
Non-linear predictors can achieve the optimal $O(d/n)$ rate without distribution assumptions.
Abstract
We study the problem of predicting as well as the best linear predictor in a bounded Euclidean ball with respect to the squared loss. When only boundedness of the data generating distribution is assumed, we establish that the least squares estimator constrained to a bounded Euclidean ball does not attain the classical excess risk rate, where is the dimension of the covariates and is the number of samples. In particular, we construct a bounded distribution such that the constrained least squares estimator incurs an excess risk of order hence refuting a recent conjecture of Ohad Shamir [JMLR 2015]. In contrast, we observe that non-linear predictors can achieve the optimal rate with no assumptions on the distribution of the covariates. We discuss additional distributional assumptions sufficient to guarantee an excess risk rate for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Advanced Statistical Methods and Models
