Reinforcement Learning: Stochastic Approximation Algorithms for Markov   Decision Processes

Vikram Krishnamurthy

arXiv:1512.07669·math.OC·December 25, 2015·1 cites

Reinforcement Learning: Stochastic Approximation Algorithms for Markov Decision Processes

Vikram Krishnamurthy

PDF

Open Access

TL;DR

This paper discusses stochastic approximation algorithms in reinforcement learning for Markov decision processes, highlighting their potential use in partially observed scenarios and providing a concise overview of their principles.

Contribution

It offers a clear and concise description of stochastic approximation algorithms applicable to reinforcement learning in Markov decision processes, including partially observed cases.

Findings

01

Algorithms can be used as suboptimal methods for partially observed MDPs.

02

Provides a concise overview of stochastic approximation techniques in reinforcement learning.

03

Highlights potential applications in complex decision-making scenarios.

Abstract

This article presents a short and concise description of stochastic approximation algorithms in reinforcement learning of Markov decision processes. The algorithms can also be used as a suboptimal method for partially observed Markov decision processes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Simulation Techniques and Applications · Advanced Multi-Objective Optimization Algorithms