Fixed Point Theory Analysis of a Lambda Policy Iteration with Randomization for the \'Ciri\'c Contraction Operator

Abdelkader Belhenniche; Roman Chertovskih

arXiv:2405.07824·math.OC·October 16, 2025

Fixed Point Theory Analysis of a Lambda Policy Iteration with Randomization for the \'Ciri\'c Contraction Operator

Abdelkader Belhenniche, Roman Chertovskih

PDF

Open Access

TL;DR

This paper uses fixed point theory to analyze a randomized Lambda policy iteration algorithm for weak contraction mappings, broadening the scope beyond traditional strong contractions in reinforcement learning.

Contribution

It extends fixed point analysis to weak contraction mappings in policy iteration, providing convergence conditions in infinite-dimensional spaces.

Findings

01

Convergence with probability one under general assumptions

02

Applicable to broader class of mappings than traditional contractions

03

Provides theoretical foundation for reinforcement learning algorithms

Abstract

We apply methods of the fixed point theory to a Lambda policy iteration with a randomization algorithm for weak contractions mappings. This type of mappings covers a broader range than the strong contractions typically considered in the literature, such as \'Ciri\'c contraction. Specifically, we explore the characteristics of reinforcement learning procedures developed for feedback control within the context of fixed point theory. Under relatively general assumptions, we identify the sufficient conditions for convergence with a probability of one in infinite-dimensional policy spaces.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematical Approximation and Integration