Multiply Accelerated Value Iteration for Non-Symmetric Affine Fixed Point Problems and application to Markov Decision Processes
Marianne Akian, St\'ephane Gaubert, Zheng Qu, Omar Saadi

TL;DR
This paper introduces accelerated algorithms for affine fixed point problems with non-symmetric matrices, applicable to Markov decision processes, achieving faster convergence rates through spectral analysis and higher-order methods.
Contribution
It develops a modified Nesterov gradient algorithm for non-self-adjoint matrices and introduces a higher-order accelerated method with improved convergence under spectral conditions.
Findings
The modified algorithm converges with an accelerated asymptotic rate for certain spectra.
The $d$th-order method achieves multiply accelerated rates under stricter spectral conditions.
Numerical experiments demonstrate the effectiveness of these accelerated schemes in MDP applications.
Abstract
We analyze a modified version of Nesterov accelerated gradient algorithm, which applies to affine fixed point problems with non self-adjoint matrices, such as the ones appearing in the theory of Markov decision processes with discounted or mean payoff criteria. We characterize the spectra of matrices for which this algorithm does converge with an accelerated asymptotic rate. We also introduce a th-order algorithm, and show that it yields a multiply accelerated rate under more demanding conditions on the spectrum. We subsequently apply these methods to develop accelerated schemes for non-linear fixed point problems arising from Markov decision processes. This is illustrated by numerical experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Optimization and Variational Analysis
