How Does Independence Help Generalization? Sample Complexity of ERM on Product Distributions
Tao Lin

TL;DR
This paper investigates the sample complexity of Empirical Risk Minimization (ERM) on product distributions, revealing that ERM still requires exponential samples despite the distribution's simplicity, indicating the need for specialized algorithms.
Contribution
It characterizes the sample complexity of ERM on product distributions, showing it remains exponential, unlike other algorithms that leverage distribution structure.
Findings
ERM requires exponential samples on product distributions
Product distributions alone do not simplify ERM's sample complexity
Specialized algorithms are necessary for efficient learning on product distributions
Abstract
While many classical notions of learnability (e.g., PAC learnability) are distribution-free, utilizing the specific structures of an input distribution may improve learning performance. For example, a product distribution on a multi-dimensional input space has a much simpler structure than a correlated distribution. A recent paper [GHTZ21] shows that the sample complexity of a general learning problem on product distributions is polynomial in the input dimension, which is exponentially smaller than that on correlated distributions. However, the learning algorithm they use is not the standard Empirical Risk Minimization (ERM) algorithm. In this note, we characterize the sample complexity of ERM in a general learning problem on product distributions. We show that, even though product distributions are simpler than correlated distributions, ERM still needs an exponential number of samples…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
