A hypothesis-testing perspective on the G-normal distribution theory
Shige Peng, Quan Zhou

TL;DR
This paper investigates the tail behavior of the G-normal distribution using a nonlinear heat equation approach, providing asymptotic results that improve tail probability estimation and reveal vulnerabilities in heteroscedastic hypothesis testing.
Contribution
It introduces a novel analysis of the G-normal distribution's tail behavior via a nonlinear heat equation, impacting hypothesis testing for heteroscedastic data.
Findings
Asymptotic tail probability estimates with high accuracy.
Potential for manipulating heteroscedastic data to falsely attain significance.
Insights into the limitations of current hypothesis testing methods.
Abstract
The G-normal distribution was introduced by Peng [2007] as the limiting distribution in the central limit theorem for sublinear expectation spaces. Equivalently, it can be interpreted as the solution to a stochastic control problem where we have a sequence of random variables, whose variances can be chosen based on all past information. In this note we study the tail behavior of the G-normal distribution through analyzing a nonlinear heat equation. Asymptotic results are provided so that the tail "probabilities" can be easily evaluated with high accuracy. This study also has a significant impact on the hypothesis testing theory for heteroscedastic data; we show that even if the data are generated under the null hypothesis, it is possible to cheat and attain statistical significance by sequentially manipulating the error variances of the observations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Stochastic processes and financial applications · Risk and Portfolio Optimization
