Uniform Convergence Beyond Glivenko-Cantelli
Tanmay Devale, Pramith Devulapalli, Steve Hanneke

TL;DR
This paper extends the concept of uniform convergence beyond the classical empirical mean estimator, introducing UME-learnability to characterize when collections of distributions allow for uniform mean estimation by any estimator, and proves properties of such collections.
Contribution
It introduces UME-learnability as a new framework for uniform mean estimation beyond empirical means and characterizes its conditions, including separability and closure properties.
Findings
Separability of mean vectors is sufficient for UME-learnability.
Non-separable mean vectors can still be UME-learnable with different techniques.
Countable unions of UME-learnable collections are UME-learnable.
Abstract
We characterize conditions under which collections of distributions on admit uniform estimation of their mean. Prior work from Vapnik and Chervonenkis (1971) has focused on uniform convergence using the empirical mean estimator, leading to the principle known as Glivenko-Cantelli. We extend this framework by moving beyond the empirical mean estimator and introducing Uniform Mean Estimability, also called UME-learnability, which captures when a collection permits uniform mean estimation by any arbitrary estimator. We work on the space created by the mean vectors of the collection of distributions. For each distribution, the mean vector records the expected value in each coordinate. We show that separability of the mean vectors is a sufficient condition for UME-learnability. However, we show that separability of the mean vectors is not necessary for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques
