Catoni-style confidence sequences for heavy-tailed mean estimation

Hongjian Wang; Aaditya Ramdas

arXiv:2202.01250·math.ST·February 9, 2024

Catoni-style confidence sequences for heavy-tailed mean estimation

Hongjian Wang, Aaditya Ramdas

PDF

Open Access

TL;DR

This paper introduces three methods for constructing confidence sequences for the mean of heavy-tailed distributions, with the Catoni-style approach achieving near-optimal performance under minimal variance assumptions.

Contribution

It presents novel confidence sequence techniques that work under only variance bounds, extending applicability to heavy-tailed data unlike previous methods.

Findings

01

Catoni-style confidence sequence performs well in practice.

02

Achieves the lower bound of $rac{ oot frac{ ext{log log t}}{t}}$.

03

Extends methods to data with infinite variance but finite $p$-th moments.

Abstract

A confidence sequence (CS) is a sequence of confidence intervals that is valid at arbitrary data-dependent stopping times. These are useful in applications like A/B testing, multi-armed bandits, off-policy evaluation, election auditing, etc. We present three approaches to constructing a confidence sequence for the population mean, under the minimal assumption that only an upper bound $σ^{2}$ on the variance is known. While previous works rely on light-tail assumptions like boundedness or subGaussianity (under which all moments of a distribution exist), the confidence sequences in our work are able to handle data from a wide range of heavy-tailed distributions. The best among our three methods -- the Catoni-style confidence sequence -- performs remarkably well in practice, essentially matching the state-of-the-art methods for $σ^{2}$ -subGaussian data, and provably attains the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Distributed Sensor Networks and Detection Algorithms