Improved Active Learning via Dependent Leverage Score Sampling
Atsushi Shimizu, Xiaoou Cheng, Christopher Musco, Jonathan Weare

TL;DR
This paper introduces a new active learning method using dependent leverage score sampling, specifically pivotal sampling, which reduces sample complexity by up to 50% in certain settings and is supported by theoretical guarantees.
Contribution
It proposes a practical pivotal sampling-based active learning algorithm that improves sample efficiency and provides theoretical analysis under weak independence conditions.
Findings
Reduces sample complexity by up to 50% compared to independent sampling.
Matches the sample complexity of independent methods for $d$-dimensional linear functions.
Achieves improved bounds for polynomial regression with $O(d)$ samples.
Abstract
We show how to obtain improved active learning methods in the agnostic (adversarial noise) setting by combining marginal leverage score sampling with non-independent sampling strategies that promote spatial coverage. In particular, we propose an easily implemented method based on the \emph{pivotal sampling algorithm}, which we test on problems motivated by learning-based methods for parametric PDEs and uncertainty quantification. In comparison to independent sampling, our method reduces the number of samples needed to reach a given target accuracy by up to . We support our findings with two theoretical results. First, we show that any non-independent leverage score sampling method that obeys a weak \emph{one-sided independence condition} (which includes pivotal sampling) can actively learn dimensional linear functions with samples, matching…
Peer Reviews
Decision·ICLR 2024 oral
The paper is well-written with clear motivation for the main theoretical results and nice experimental illustrations. The modified pivotal sampling algorithm (Algorithm 2) is appealing and seems to be quite easy to implement.
My main concerns are stated below and are related with the technical novelty of the results of the analysis of Theorems~1.1 and 1.2. The experimental section is sufficient for the demonstration of the approach, yet it could be elaborated to the higher-dimensional problems.
— The paper is extremely well-written, providing a thorough discussion on the advantages and limitations of the authors’ work alongside prior research. — The result on "any" sampling strategy is very interesting, even if it doesn’t improve upon previous theoretical guarantees for the given scenario. — Most sampling-based results obscure significant constants, as noted by the authors regarding the theoretically optimal result due to [Chen and Price 2019], which, in practice, performs poorly. Th
— The empirical evaluation lacks clarity regarding the improvements. The authors should explicitly state the claimed 50% reduction in samples in the technical sections. — The theoretical improvements are solely for polynomial regression, which is essentially linear regression with an infinite number of rows. It would be interesting to explore whether the pivotal sampling method has other implications for different norms. For instance, can this "tournament" style technique be applied to all norm
a. The proposed pivotal sampling method using a binary tree tournament is interesting and innovative. b. The theoretical results are solid. The sample complexity bound for polynomial regression showcases an improvement of a logarithmic factor over the general case. The techniques employed in the analysis have potential implications beyond this specific research. c. The authors provide empirical results to validate the effectiveness of the proposed pivotal sampling. These results clearly indica
a. My primary concern pertains to the computational complexity of the proposed method. The paper lacks both theoretical analysis and empirical results in this regard, leaving an important aspect unaddressed. b. The experimental setup in the paper includes only a single baseline, and it would be beneficial to include additional baseline methods, such as maximum leverage score sampling.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
