Scale Invariant Monte Carlo under Linear Function Approximation with   Curvature based step-size

Rahul Madhavan; Hemanta Makwana

arXiv:2104.07361·cs.LG·May 31, 2022

Scale Invariant Monte Carlo under Linear Function Approximation with Curvature based step-size

Rahul Madhavan, Hemanta Makwana

PDF

Open Access

TL;DR

This paper introduces a scale-invariant Monte Carlo algorithm with linear function approximation that uses an adaptive curvature-based step-size and heavy-ball momentum, providing convergence guarantees and improved performance in reinforcement learning.

Contribution

It presents a novel scale-invariant Monte Carlo method with adaptive curvature-based step-size and heavy-ball momentum, along with rigorous convergence proofs and empirical validation.

Findings

01

Converges to a scale-invariant solution regardless of feature vector norms

02

Adaptive curvature-based step-size accelerates convergence

03

Method outperforms traditional algorithms in simulations

Abstract

We study the feature-scaled version of the Monte Carlo algorithm with linear function approximation. This algorithm converges to a scale-invariant solution, which is not unduly affected by states having feature vectors with large norms. The usual versions of the MCMC algorithm, obtained by minimizing the least-squares criterion, do not produce solutions that give equal importance to all states irrespective of feature-vector norm -- a requirement that may be critical in many reinforcement learning contexts. To speed up convergence in our algorithm, we introduce an adaptive step-size based on the curvature of the iterate convergence path -- a novelty that may be useful in more general optimization contexts as well. A key contribution of this paper is to prove convergence, in the presence of adaptive curvature based step-size and heavy-ball momentum. We provide rigorous theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Markov Chains and Monte Carlo Methods · Stochastic Gradient Optimization Techniques