TL;DR
This paper investigates active sequential mean estimation, proposing a new analysis and showing how query probabilities converge under certain learning strategies, supported by simulations.
Contribution
It provides a non-asymptotic, data-dependent confidence bound analysis and insights into query probability convergence in active learning.
Findings
The smallest confidence width occurs when the constant probability dominates.
The confidence interval bound is data-dependent and non-asymptotic.
Query probability converges to the maximum constraint when using no-regret learning.
Abstract
In this work, we revisit the problem of active sequential prediction-powered mean estimation, where at each round one must decide the query probability of the ground-truth label upon observing the covariates of a sample. Furthermore, if the label is not queried, the prediction from a machine learning model is used instead. Prior work proposed an elegant scheme that determines the query probability by combining an uncertainty-based suggestion with a constant probability that encodes a soft constraint on the query probability. We explored different values of the mixing parameter and observed an intriguing empirical pattern: the smallest confidence width tends to occur when the weight on the constant probability is close to one, thereby reducing the influence of the uncertainty-based component. Motivated by this observation, we develop a non-asymptotic analysis of the estimator and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
