An Exponential Averaging Process with Strong Convergence Properties
Frederik K\"ohne, Anton Schiela

TL;DR
This paper introduces a modified exponential averaging method called p-EMA with improved stochastic convergence properties, applicable to noisy observations in random dynamical systems and adaptive SGD step size control.
Contribution
The paper proposes p-EMA, an adaptation of EMA with weights decreasing to zero, and provides convergence guarantees under mild autocorrelation assumptions.
Findings
p-EMA achieves strong stochastic convergence.
Applicable to noisy observations in dynamical systems.
Implications for adaptive SGD step size control.
Abstract
Averaging, or smoothing, is a fundamental approach to obtain stable, de-noised estimates from noisy observations. In certain scenarios, observations made along trajectories of random dynamical systems are of particular interest. One popular smoothing technique for such a scenario is exponential moving averaging (EMA), which assigns observations a weight that decreases exponentially in their age, thus giving younger observations a larger weight. However, EMA fails to enjoy strong stochastic convergence properties, which stems from the fact that the weight assigned to the youngest observation is constant over time, preventing the noise in the averaged quantity from decreasing to zero. In this work, we consider an adaptation to EMA, which we call -EMA, where the weights assigned to the last observations decrease to zero at a subharmonic rate. We provide stochastic convergence guarantees…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic processes and financial applications · Gaussian Processes and Bayesian Inference
