Computing the Expected Value and Variance of Geometric Measures
Frank Staals, Constantinos Tsirogiannis

TL;DR
This paper develops efficient algorithms to exactly compute and approximate the mean and variance of various geometric measures on random subsets of points in high-dimensional space, with applications in ecology.
Contribution
It introduces novel exact and approximation algorithms for statistical analysis of geometric measures under two common random subset distributions.
Findings
Algorithms for bounding box volume, convex hull volume, MPD, centroid distance, and diameter.
Major speedups over existing methods.
Effective approximation for mean pairwise distance.
Abstract
Let P be a set of points in R^d, and let M be a function that maps any subset of P to a positive real number. We examine the problem of computing the exact mean and variance of M when a subset of points in P is selected according to a well-defined random distribution. We consider two distributions; in the first distribution (which we call the Bernoulli distribution), each point p in P is included in the random subset independently, with probability pi(p). In the second distribution (the fixed-size distribution), a subset of exactly s points is selected uniformly at random among all possible subsets of s points in P. This problem is a crucial part of modern ecological analyses; each point in P represents a species in d-dimensional trait space, and the goal is to compute the statistics of a geometric measure on this trait space, when subsets of species are selected under random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEcology and Vegetation Dynamics Studies · Wildlife-Road Interactions and Conservation · Land Use and Ecosystem Services
