Integrating Multi-Armed Bandit, Active Learning, and Distributed Computing for Scalable Optimization

Foo Hui-Mean; Yuan-chin Ivan Chang

arXiv:2601.00615·stat.CO·January 5, 2026

Integrating Multi-Armed Bandit, Active Learning, and Distributed Computing for Scalable Optimization

Foo Hui-Mean, Yuan-chin Ivan Chang

PDF

Open Access

TL;DR

This paper introduces ALMAB-DC, a scalable, modular framework combining active learning, multi-armed bandits, and distributed computing to efficiently solve complex black-box optimization problems with theoretical guarantees and superior empirical performance.

Contribution

The paper presents ALMAB-DC, a novel unified framework that integrates multiple techniques for scalable black-box optimization, with theoretical analysis and extensive empirical validation.

Findings

01

Outperforms state-of-the-art black-box optimizers on benchmarks.

02

Provides theoretical regret bounds for UCB and Thompson sampling variants.

03

Demonstrates scalability and efficiency in high-dimensional, resource-intensive tasks.

Abstract

Modern optimization problems in scientific and engineering domains often rely on expensive black-box evaluations, such as those arising in physical simulations or deep learning pipelines, where gradient information is unavailable or unreliable. In these settings, conventional optimization methods quickly become impractical due to prohibitive computational costs and poor scalability. We propose ALMAB-DC, a unified and modular framework for scalable black-box optimization that integrates active learning, multi-armed bandits, and distributed computing, with optional GPU acceleration. The framework leverages surrogate modeling and information-theoretic acquisition functions to guide informative sample selection, while bandit-based controllers dynamically allocate computational resources across candidate evaluations in a statistically principled manner. These decisions are executed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference