Making Simulated Annealing Sample Efficient for Discrete Stochastic Optimization
Suhail M. Shah

TL;DR
This paper analyzes the regret of simulated annealing (SA) in discrete stochastic optimization, showing it converges efficiently without increased sampling effort in noisy settings and proposing a heuristic for multi-armed bandits with logarithmic regret.
Contribution
It demonstrates that SA's regret depends on Gibbs measure convergence, and introduces modifications to reduce sampling complexity, making SA a competitive exploration heuristic.
Findings
SA's regret depends on Gibbs measure convergence rate.
SA does not require increased sampling effort with noise for convergence.
A SA-inspired heuristic achieves O(log n) regret in multi-armed bandits.
Abstract
We study the regret of simulated annealing (SA) based approaches to solving discrete stochastic optimization problems. The main theoretical conclusion is that the regret of the simulated annealing algorithm, with either noisy or noiseless observations, depends primarily upon the rate of the convergence of the associated Gibbs measure to the optimal states. In contrast to previous works, we show that SA does not need an increased estimation effort (number of \textit{pulls/samples} of the selected \textit{arm/solution} per round for a finite horizon ) with noisy observations to converge in probability. By simple modifications, we can make the total number of samples per iteration required for convergence (in probability) to scale as . Additionally, we show that a simulated annealing inspired heuristic can solve the problem of stochastic multi-armed bandits (MAB), by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Multi-Objective Optimization Algorithms · Machine Learning and Algorithms
