Mean, Variance and Asymptotic Property for General Hypergeometric Distribution
Xing-gang Mao, Xiao-yan Xue

TL;DR
This paper derives exact formulas and proofs for the mean and variance of variables in the general hypergeometric distribution, and analyzes their asymptotic behavior, especially when the mean is small.
Contribution
It provides the first complete formulas and rigorous proofs for mean and variance of GHGD variables for any case, extending previous partial results.
Findings
Exact formulas for mean and variance for any GHGD case
Asymptotic behavior shows variance approximates mean when mean is small
When mean < 1, variance can be accurately approximated by the mean
Abstract
General hypergeometric distribution (GHGD) definition: from a finite space containing elements, randomly select totally subsets (each contains elements, ), what is the probability that exactly elements are overlapped exactly times or at least times ( or )? The GHGD described the distribution of random variables and . In our previous results, we obtained the formulas of mathematical expectation and variance for special situations (), and not provided proofs. Here, we completed the exact formulas of mean and variance for and for any situation, and provided strict mathematical proofs. In addition, we give the asymptotic property of the variables. When the mean approaches to 0, the variance fast approaches to the value of mean, and actually, their difference is a higher order…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Distribution Estimation and Applications
