High confidence estimates of the mean of heavy-tailed real random   variables

Olivier Catoni

arXiv:0909.5366·math.ST·September 30, 2009·5 cites

High confidence estimates of the mean of heavy-tailed real random variables

Olivier Catoni

PDF

Open Access

TL;DR

This paper introduces robust estimators for the mean of heavy-tailed distributions using PAC-Bayesian truncation, achieving near-minimax deviations and explicit confidence intervals under certain prior bounds.

Contribution

It develops a new iterative truncation scheme for mean estimation that is nearly minimax optimal for heavy-tailed distributions, with methods to calibrate and adapt to unknown parameters.

Findings

01

The proposed estimators have deviations close to the theoretical minimax bounds.

02

Explicit confidence intervals are derived when variance or kurtosis bounds are known.

03

A new variance estimator with good large deviation properties is introduced.

Abstract

We present new estimators of the mean of a real valued random variable, based on PAC-Bayesian iterative truncation. We analyze the non-asymptotic minimax properties of the deviations of estimators for distributions having either a bounded variance or a bounded kurtosis. It turns out that these minimax deviations are of the same order as the deviations of the empirical mean estimator of a Gaussian distribution. Nevertheless, the empirical mean itself performs poorly at high confidence levels for the worst distribution with a given variance or kurtosis (which turns out to be heavy tailed). To obtain (nearly) minimax deviations in these broad class of distributions, it is necessary to use some more robust estimator, and we describe an iterated truncation scheme whose deviations are close to minimax. In order to calibrate the truncation and obtain explicit confidence intervals, it is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Advanced Bandit Algorithms Research