Finite-sample and asymptotic analysis of generalization ability with an   application to penalized regression

Ning Xu; Jian Hong; Timothy C.G. Fisher

arXiv:1609.03344·stat.ML·September 14, 2016

Finite-sample and asymptotic analysis of generalization ability with an application to penalized regression

Ning Xu, Jian Hong, Timothy C.G. Fisher

PDF

TL;DR

This paper provides a comprehensive analysis of the generalization ability of extremum estimators, deriving bounds on out-of-sample errors, exploring hyper-parameter tuning, and establishing $L_2$-consistency for penalized regression in high-dimensional settings.

Contribution

It introduces new bounds on prediction errors, links generalization ability to hyper-parameter tuning, and proves $L_2$-consistency of penalized regression estimators in both high-dimensional and low-dimensional cases.

Findings

01

Derived upper bounds on out-of-sample prediction errors.

02

Showed how cross-validation parameter $K$ influences bias-variance trade-off.

03

Proved $L_2$-consistency of penalized regression estimates.

Abstract

In this paper, we study the performance of extremum estimators from the perspective of generalization ability (GA): the ability of a model to predict outcomes in new samples from the same population. By adapting the classical concentration inequalities, we derive upper bounds on the empirical out-of-sample prediction errors as a function of the in-sample errors, in-sample data size, heaviness in the tails of the error distribution, and model complexity. We show that the error bounds may be used for tuning key estimation hyper-parameters, such as the number of folds $K$ in cross-validation. We also show how $K$ affects the bias-variance trade-off for cross-validation. We demonstrate that the $L_{2}$ -norm difference between penalized and the corresponding un-penalized regression estimates is directly explained by the GA of the estimates and the GA of empirical moment conditions.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.