Locally sharp goodness-of-fit testing in sup norm for high-dimensional counts
Subhodh Kotekal, Julien Chhor, Chao Gao

TL;DR
This paper investigates the difficulty of testing goodness-of-fit for high-dimensional count data using sup norm, revealing that the local minimax separation rate depends on the decay of category rates and providing sharp bounds and constants.
Contribution
It establishes the local minimax separation rate for sup norm goodness-of-fit testing in high-dimensional counts, with sharp constants in an asymptotic regime.
Findings
The local minimax separation rate depends on the decay behavior of category rates.
An upper bound is achieved via a test based on the sample maximum.
A lower bound is derived by reducing to a homoskedastic null model.
Abstract
We consider testing the goodness-of-fit of a distribution against alternatives separated in sup norm. We study the twin settings of Poisson-generated count data with a large number of categories and high-dimensional multinomials. In previous studies of different separation metrics, it has been found that the local minimax separation rate exhibits substantial heterogeneity and is a complicated function of the null distribution; the rate-optimal test requires careful tailoring to the null. In the setting of sup norm, this remains the case and we establish that the local minimax separation rate is determined by the finer decay behavior of the category rates. The upper bound is obtained by a test involving the sample maximum, and the lower bound argument involves reducing the original heteroskedastic null to an auxiliary homoskedastic null determined by the decay of the rates. Further, in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Process Monitoring · Machine Learning and Algorithms · Statistical Methods and Inference
