Near-Exponential Savings for Mean Estimation with Active Learning

Julian M. Morimoto; Jacob Goldin; and Daniel E. Ho

arXiv:2511.05736·cs.LG·November 11, 2025

Near-Exponential Savings for Mean Estimation with Active Learning

Julian M. Morimoto, Jacob Goldin, and Daniel E. Ho

PDF

Open Access

TL;DR

This paper introduces PartiBandits, an active learning algorithm that achieves near-exponential savings in label complexity for mean estimation of a multi-class variable by adaptively partitioning data and using UCB strategies.

Contribution

The paper proposes a novel two-stage active learning algorithm that combines UCB and disagreement-based methods, achieving minimax optimal convergence rates for mean estimation.

Findings

01

Achieves near-exponential label savings with respect to N.

02

Convergence rates are minimax optimal in classical settings.

03

Demonstrates effectiveness through simulations with electronic health records.

Abstract

We study the problem of efficiently estimating the mean of a $k$ -class random variable, $Y$ , using a limited number of labels, $N$ , in settings where the analyst has access to auxiliary information (i.e.: covariates) $X$ that may be informative about $Y$ . We propose an active learning algorithm ("PartiBandits") to estimate $E [Y]$ . The algorithm yields an estimate, $μ_{PB}$ , such that $(μ_{PB} - E [Y])^{2}$ is $\tilde{O} (\frac{ν + e x p ( c \cdot ( - N / l o g ( N )))}{N})$ , where $c > 0$ is a constant and $ν$ is the risk of the Bayes-optimal classifier. PartiBandits is essentially a two-stage algorithm. In the first stage, it learns a partition of the unlabeled data that shrinks the average conditional variance of $Y$ . In the second stage it uses a UCB-style subroutine ("WarmStart-UCB") to request labels…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Imbalanced Data Classification Techniques