Adaptive Batch Sizes for Active Learning A Probabilistic Numerics Approach
Masaki Adachi, Satoshi Hayakawa, Martin J{\o}rgensen, Xingchen Wan, Vu, Nguyen, Harald Oberhauser, Michael A. Osborne

TL;DR
This paper introduces a probabilistic numerics framework for adaptive batch sizing in active learning, optimizing the trade-off between cost and speed by automatically tuning batch sizes based on integration error, improving efficiency in Bayesian learning and optimization.
Contribution
The paper presents a novel probabilistic numerics approach that adaptively adjusts batch sizes during active learning, eliminating the need for exhaustive batch size searches and enhancing efficiency.
Findings
Significantly improves learning efficiency in active learning.
Automatically adapts batch sizes based on integration error.
Enhances flexibility in Bayesian optimization applications.
Abstract
Active learning parallelization is widely used, but typically relies on fixing the batch size throughout experimentation. This fixed approach is inefficient because of a dynamic trade-off between cost and speed -- larger batches are more costly, smaller batches lead to slower wall-clock run-times -- and the trade-off may change over the run (larger batches are often preferable earlier). To address this trade-off, we propose a novel Probabilistic Numerics framework that adaptively changes batch sizes. By framing batch selection as a quadrature task, our integration-error-aware algorithm facilitates the automatic tuning of batch sizes to meet predefined quadrature precision objectives, akin to how typical optimizers terminate based on convergence thresholds. This approach obviates the necessity for exhaustive searches across all potential batch sizes. We also extend this to scenarios with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Reservoir Engineering and Simulation Methods · Advanced Statistical Process Monitoring
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
