Making Simulated Annealing Sample Efficient for Discrete Stochastic   Optimization

Suhail M. Shah

arXiv:2009.06188·math.OC·March 29, 2021

Making Simulated Annealing Sample Efficient for Discrete Stochastic Optimization

Suhail M. Shah

PDF

Open Access

TL;DR

This paper analyzes the regret of simulated annealing (SA) in discrete stochastic optimization, showing it converges efficiently without increased sampling effort in noisy settings and proposing a heuristic for multi-armed bandits with logarithmic regret.

Contribution

It demonstrates that SA's regret depends on Gibbs measure convergence, and introduces modifications to reduce sampling complexity, making SA a competitive exploration heuristic.

Findings

01

SA's regret depends on Gibbs measure convergence rate.

02

SA does not require increased sampling effort with noise for convergence.

03

A SA-inspired heuristic achieves O(log n) regret in multi-armed bandits.

Abstract

We study the regret of simulated annealing (SA) based approaches to solving discrete stochastic optimization problems. The main theoretical conclusion is that the regret of the simulated annealing algorithm, with either noisy or noiseless observations, depends primarily upon the rate of the convergence of the associated Gibbs measure to the optimal states. In contrast to previous works, we show that SA does not need an increased estimation effort (number of \textit{pulls/samples} of the selected \textit{arm/solution} per round for a finite horizon $n$ ) with noisy observations to converge in probability. By simple modifications, we can make the total number of samples per iteration required for convergence (in probability) to scale as $O (n)$ . Additionally, we show that a simulated annealing inspired heuristic can solve the problem of stochastic multi-armed bandits (MAB), by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Multi-Objective Optimization Algorithms · Machine Learning and Algorithms