Adaptive system optimization using random directions stochastic approximation
Prashanth L.A., Shalabh Bhatnagar, Michael Fu, Steve Marcus

TL;DR
This paper introduces new first- and second-order RDSA algorithms with both continuous and discrete perturbations, providing convergence proofs and demonstrating improved performance over existing methods through theoretical analysis and numerical validation.
Contribution
It presents novel RDSA algorithms with asymmetric Bernoulli perturbations, including a Hessian estimation scheme, and proves their unbiasedness and convergence, advancing simulation optimization techniques.
Findings
Unbiased gradient and Hessian estimates established.
Asymptotic convergence and normality proved for both schemes.
Asymmetric Bernoulli RDSA outperforms 2SPSA in simulations.
Abstract
We present novel algorithms for simulation optimization using random directions stochastic approximation (RDSA). These include first-order (gradient) as well as second-order (Newton) schemes. We incorporate both continuous-valued as well as discrete-valued perturbations into both our algorithms. The former are chosen to be independent and identically distributed (i.i.d.) symmetric, uniformly distributed random variables (r.v.), while the latter are i.i.d., asymmetric, Bernoulli r.v.s. Our Newton algorithm, with a novel Hessian estimation scheme, requires N-dimensional perturbations and three loss measurements per iteration, whereas the simultaneous perturbation Newton search algorithm of [1] requires 2N-dimensional perturbations and four loss measurements per iteration. We prove the unbiasedness of both gradient and Hessian estimates and asymptotic (strong) convergence for both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Adaptive Filtering Techniques · Neural Networks and Applications · Simulation Techniques and Applications
