Estimating the Maximum Expected Value: An Analysis of (Nested) Cross   Validation and the Maximum Sample Average

Hado van Hasselt

arXiv:1302.7175·stat.ML·March 4, 2013·19 cites

Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average

Hado van Hasselt

PDF

Open Access

TL;DR

This paper analyzes the bias and variance of estimators for the maximum expected value, focusing on the maximum sample average and cross validation, highlighting their limitations and problem-dependent performance.

Contribution

It provides a theoretical analysis of the bias, variance, and consistency of common estimators for maximum expected value, including bounds and insights into their trade-offs.

Findings

01

No unbiased estimator exists for the maximum expected value.

02

Cross validation can reduce variance but may introduce large bias.

03

Estimator performance is highly problem-dependent.

Abstract

We investigate the accuracy of the two most common estimators for the maximum expected value of a general set of random variables: a generalization of the maximum sample average, and cross validation. No unbiased estimator exists and we show that it is non-trivial to select a good estimator without knowledge about the distributions of the random variables. We investigate and bound the bias and variance of the aforementioned estimators and prove consistency. The variance of cross validation can be significantly reduced, but not without risking a large bias. The bias and variance of different variants of cross validation are shown to be very problem-dependent, and a wrong choice can lead to very inaccurate estimates.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Risk and Portfolio Optimization · Bayesian Modeling and Causal Inference