Scaling Effects and Uncertainty Quantification in Neural Actor Critic Algorithms

Nikos Georgoudios; Konstantinos Spiliopoulos; Justin Sirignano

arXiv:2601.17954·cs.LG·January 27, 2026

Scaling Effects and Uncertainty Quantification in Neural Actor Critic Algorithms

Nikos Georgoudios, Konstantinos Spiliopoulos, Justin Sirignano

PDF

Open Access

TL;DR

This paper analyzes how different scaling schemes in neural Actor Critic algorithms affect convergence and uncertainty quantification, providing guidelines for hyperparameter selection to improve statistical robustness.

Contribution

It introduces a comprehensive statistical analysis of neural Actor Critic algorithms under various scaling regimes, extending previous convergence results to uncertainty quantification.

Findings

01

Variance decays as a power of network width with an exponent depending on the scaling parameter

02

Faster convergence observed for certain scaling choices in numerical experiments

03

Guidelines for hyperparameter tuning based on theoretical analysis

Abstract

We investigate the neural Actor Critic algorithm using shallow neural networks for both the Actor and Critic models. The focus of this work is twofold: first, to compare the convergence properties of the network outputs under various scaling schemes as the network width and the number of training steps tend to infinity; and second, to provide precise control of the approximation error associated with each scaling regime. Previous work has shown convergence to ordinary differential equations with random initial conditions under inverse square root scaling in the network width. In this work, we shift the focus from convergence speed alone to a more comprehensive statistical characterization of the algorithm's output, with the goal of quantifying uncertainty in neural Actor Critic methods. Specifically, we study a general inverse polynomial scaling in the network width, with an exponent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis