A Nonparametric Goodness-of-Fit Test for High-Dimensional Generalized Gaussian Distributions via Nearest-Neighbor Graphs
Mehmet S{\i}dd{\i}k \c{C}ad{\i}rc{\i}, Yener \"Unal

TL;DR
This paper proposes a new non-parametric, affine-invariant goodness-of-fit test for high-dimensional generalized Gaussian distributions using nearest-neighbor graphs, addressing challenges in modern high-dimensional data analysis.
Contribution
It introduces a robust, fully non-parametric test based on NN graph topology, with asymptotic validity and improved power over existing methods in high-dimensional settings.
Findings
Accurate Type I error control across dimensions and tail shapes.
Demonstrates robustness and power against heavy- and light-tailed alternatives.
Outperforms energy-distance benchmarks and graphical tests in simulations.
Abstract
The multivariate generalised Gaussian distribution (MGGD) is commonly used to model high-dimensional vectors with non-Gaussian radial behaviour, ranging from sharp-peaked to heavy-tailed profiles. However, because many classical multivariate tests are based on covariance inversion or high-dimensional density estimation, formal goodness-of-fit assessment for MGGD models remains challenging in modern regimes where the dimension is comparable to or exceeds the sample size. We introduce an affine-invariant, fully non-parametric goodness-of-fit procedure based on the nearest neighbour (NN) graph topology and the adapted zero principle. Following robust standardisation, we construct an independent reference sample from the adapted standardised MGGD and measure, on the combined NN graph, the cross-edge count to assess how well the observed and reference point clouds exhibit the mixture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
