Goodness-of-Fit Tests for Random Partitions via Symmetric Polynomials
Chao Gao

TL;DR
This paper introduces a novel goodness-of-fit testing method for categorical distributions that leverages symmetric polynomials to account for label permutation invariance, providing asymptotic chi-squared distribution and near-optimal power.
Contribution
It develops a new test statistic based on symmetric polynomials that handles label permutation invariance without matching labels, advancing the methodology for categorical goodness-of-fit testing.
Findings
Asymptotic distribution of the test statistic is chi-squared.
The test achieves near-optimal power under local alternatives.
The method extends to two-sample testing scenarios.
Abstract
We consider goodness-of-fit tests with i.i.d. samples generated from a categorical distribution . For a given , we test the null hypothesis whether for some label permutation . The uncertainty of label permutation implies that the null hypothesis is composite instead of being singular. In this paper, we construct a testing procedure using statistics that are defined as indefinite integrals of some symmetric polynomials. This method is aimed directly at the invariance of the problem, and avoids the need of matching the unknown labels. The asymptotic distribution of the testing statistic is shown to be chi-squared, and its power is proved to be nearly optimal under a local alternative hypothesis. Various degenerate structures of the null hypothesis are carefully analyzed in the paper. A two-sample version of the test is also studied.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Combinatorial Mathematics · Random Matrices and Applications
