The Sample Complexity of Multi-Distribution Learning for VC Classes
Pranjal Awasthi, Nika Haghtalab, Eric Zhao

TL;DR
This paper investigates the sample complexity of learning VC classes across multiple distributions, highlighting gaps between existing upper and lower bounds and discussing recent progress and fundamental hurdles in the field.
Contribution
It analyzes the bounds for multi-distribution learning of VC classes, clarifies the gap between known upper and lower bounds, and discusses recent advancements and challenges.
Findings
Upper bound on sample complexity: $O(rac{1}{\
Lower bound: $\\Omega(rac{1}{\
Discussion of fundamental hurdles in game dynamics for statistical learning.
Abstract
Multi-distribution learning is a natural generalization of PAC learning to settings with multiple data distributions. There remains a significant gap between the known upper and lower bounds for PAC-learnable classes. In particular, though we understand the sample complexity of learning a VC dimension d class on distributions to be , the best lower bound is . We discuss recent progress on this problem and some hurdles that are fundamental to the use of game dynamics in statistical learning.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Imbalanced Data Classification Techniques · Machine Learning and Data Classification
