Scaling Effects and Uncertainty Quantification in Neural Actor Critic Algorithms
Nikos Georgoudios, Konstantinos Spiliopoulos, Justin Sirignano

TL;DR
This paper analyzes how different scaling schemes in neural Actor Critic algorithms affect convergence and uncertainty quantification, providing guidelines for hyperparameter selection to improve statistical robustness.
Contribution
It introduces a comprehensive statistical analysis of neural Actor Critic algorithms under various scaling regimes, extending previous convergence results to uncertainty quantification.
Findings
Variance decays as a power of network width with an exponent depending on the scaling parameter
Faster convergence observed for certain scaling choices in numerical experiments
Guidelines for hyperparameter tuning based on theoretical analysis
Abstract
We investigate the neural Actor Critic algorithm using shallow neural networks for both the Actor and Critic models. The focus of this work is twofold: first, to compare the convergence properties of the network outputs under various scaling schemes as the network width and the number of training steps tend to infinity; and second, to provide precise control of the approximation error associated with each scaling regime. Previous work has shown convergence to ordinary differential equations with random initial conditions under inverse square root scaling in the network width. In this work, we shift the focus from convergence speed alone to a more comprehensive statistical characterization of the algorithm's output, with the goal of quantifying uncertainty in neural Actor Critic methods. Specifically, we study a general inverse polynomial scaling in the network width, with an exponent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis
