Catoni-style confidence sequences for heavy-tailed mean estimation
Hongjian Wang, Aaditya Ramdas

TL;DR
This paper introduces three methods for constructing confidence sequences for the mean of heavy-tailed distributions, with the Catoni-style approach achieving near-optimal performance under minimal variance assumptions.
Contribution
It presents novel confidence sequence techniques that work under only variance bounds, extending applicability to heavy-tailed data unlike previous methods.
Findings
Catoni-style confidence sequence performs well in practice.
Achieves the lower bound of $rac{ oot frac{ ext{log log t}}{t}}$.
Extends methods to data with infinite variance but finite $p$-th moments.
Abstract
A confidence sequence (CS) is a sequence of confidence intervals that is valid at arbitrary data-dependent stopping times. These are useful in applications like A/B testing, multi-armed bandits, off-policy evaluation, election auditing, etc. We present three approaches to constructing a confidence sequence for the population mean, under the minimal assumption that only an upper bound on the variance is known. While previous works rely on light-tail assumptions like boundedness or subGaussianity (under which all moments of a distribution exist), the confidence sequences in our work are able to handle data from a wide range of heavy-tailed distributions. The best among our three methods -- the Catoni-style confidence sequence -- performs remarkably well in practice, essentially matching the state-of-the-art methods for -subGaussian data, and provably attains the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Distributed Sensor Networks and Detection Algorithms
