Adaptive system optimization using random directions stochastic   approximation

Prashanth L.A.; Shalabh Bhatnagar; Michael Fu; Steve Marcus

arXiv:1502.05577·math.OC·August 11, 2015

Adaptive system optimization using random directions stochastic approximation

Prashanth L.A., Shalabh Bhatnagar, Michael Fu, Steve Marcus

PDF

Open Access 1 Repo

TL;DR

This paper introduces new first- and second-order RDSA algorithms with both continuous and discrete perturbations, providing convergence proofs and demonstrating improved performance over existing methods through theoretical analysis and numerical validation.

Contribution

It presents novel RDSA algorithms with asymmetric Bernoulli perturbations, including a Hessian estimation scheme, and proves their unbiasedness and convergence, advancing simulation optimization techniques.

Findings

01

Unbiased gradient and Hessian estimates established.

02

Asymptotic convergence and normality proved for both schemes.

03

Asymmetric Bernoulli RDSA outperforms 2SPSA in simulations.

Abstract

We present novel algorithms for simulation optimization using random directions stochastic approximation (RDSA). These include first-order (gradient) as well as second-order (Newton) schemes. We incorporate both continuous-valued as well as discrete-valued perturbations into both our algorithms. The former are chosen to be independent and identically distributed (i.i.d.) symmetric, uniformly distributed random variables (r.v.), while the latter are i.i.d., asymmetric, Bernoulli r.v.s. Our Newton algorithm, with a novel Hessian estimation scheme, requires N-dimensional perturbations and three loss measurements per iteration, whereas the simultaneous perturbation Newton search algorithm of [1] requires 2N-dimensional perturbations and four loss measurements per iteration. We prove the unbiasedness of both gradient and Hessian estimates and asymptotic (strong) convergence for both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

prashla/RDSA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Adaptive Filtering Techniques · Neural Networks and Applications · Simulation Techniques and Applications