Tail bounds for empirically standardized sums

Guenther Walther

arXiv:2109.06371·math.ST·March 22, 2022

Tail bounds for empirically standardized sums

Guenther Walther

PDF

Open Access

TL;DR

This paper investigates tail bounds for sums standardized using empirical methods, showing that proper Studentization can restore exponential tail decay, with applications to various statistical tests and heteroscedastic data analysis.

Contribution

It introduces new tail bounds for empirically standardized sums, including a novel scan statistic for heteroscedastic data with log-concave distributions.

Findings

01

Studentized sums can have sub-Gaussian tail bounds

02

A new scan statistic for heteroscedastic data is proposed

03

Tail bounds are established for different standardization methods

Abstract

Exponential tail bounds for sums play an important role in statistics, but the example of the $t$ -statistic shows that the exponential tail decay may be lost when population parameters need to be estimated from the data. However, it turns out that if Studentizing is accompanied by estimating the location parameter in a suitable way, then the $t$ -statistic regains the exponential tail behavior. Motivated by this example, the paper analyzes other ways of empirically standardizing sums and establishes tail bounds that are sub-Gaussian or even closer to normal for the following settings: Standardization with Studentized contrasts for normal observations, standardization with the log likelihood ratio statistic for observations from an exponential family, and standardization via self-normalization for observations from a symmetric distribution with unknown center of symmetry. The latter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference