Nearly Optimal Sample Complexity for Learning with Label Proportions

Robert Busa-Fekete; Travis Dick; Claudio Gentile; Haim Kaplan; Tomer Koren; Uri Stemmer

arXiv:2505.05355·cs.LG·June 2, 2025

Nearly Optimal Sample Complexity for Learning with Label Proportions

Robert Busa-Fekete, Travis Dick, Claudio Gentile, Haim Kaplan, Tomer Koren, Uri Stemmer

PDF

Open Access 1 Video

TL;DR

This paper studies Learning from Label Proportions, establishing nearly optimal sample complexity bounds and proposing algorithms that outperform existing methods both theoretically and empirically.

Contribution

It provides the first nearly optimal sample complexity analysis for LLP under square loss and introduces improved algorithms with better empirical performance.

Findings

01

Sample complexity is essentially optimal and improves on previous bounds.

02

Algorithms achieve better accuracy with fewer samples in experiments.

03

Theoretical results show improved dependence on bag size.

Abstract

We investigate Learning from Label Proportions (LLP), a partial information setting where examples in a training set are grouped into bags, and only aggregate label values in each bag are available. Despite the partial observability, the goal is still to achieve small regret at the level of individual examples. We give results on the sample complexity of LLP under square loss, showing that our sample complexity is essentially optimal. From an algorithmic viewpoint, we rely on carefully designed variants of Empirical Risk Minimization, and Stochastic Gradient Descent algorithms, combined with ad hoc variance reduction techniques. On one hand, our theoretical results improve in important ways on the existing literature on LLP, specifically in the way the sample complexity depends on the bag size. On the other hand, we validate our algorithmic solutions on several datasets, demonstrating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Nearly Optimal Sample Complexity for Learning with Label Proportions· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Machine Learning and Algorithms

MethodsHigh-Order Consensuses · Sparse Evolutionary Training