Reinforcement Learning

Olivier Buffet; Olivier Pietquin; Paul Weng

arXiv:2005.14419·cs.LG·June 16, 2020·A Guided Tour of Artificial Intelligence Research

Reinforcement Learning

Olivier Buffet, Olivier Pietquin, Paul Weng

PDF

Open Access 1 Repo 1 Video

TL;DR

Reinforcement learning (RL) is a versatile framework for adaptive decision-making, with methods like value-based and policy search approaches, and extensions for risk-averse scenarios and unknown rewards, applicable in diverse domains.

Contribution

This chapter provides a comprehensive overview of RL fundamentals, main approaches, and recent extensions, highlighting the diversity and adaptability of RL methods.

Findings

01

RL effectively applied in games and autonomous vehicles

02

Two main RL approaches: value-based and policy search

03

Extensions include risk-averse RL and scenarios with unknown rewards

Abstract

Reinforcement learning (RL) is a general framework for adaptive control, which has proven to be efficient in many domains, e.g., board games, video games or autonomous vehicles. In such problems, an agent faces a sequential decision-making problem where, at every time step, it observes its state, performs an action, receives a reward and moves to a new state. An RL agent learns by trial and error a good policy (or controller) based on observations and numeric reward feedback on the previously performed action. In this chapter, we present the basic framework of RL and recall the two main families of approaches that have been developed to learn a good policy. The first one, which is value-based, consists in estimating the value of an optimal policy, value from which a policy can be recovered, while the other, called policy search, directly works in a policy space. Actor-critic methods can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Kikumu/Reinforcement-Learning-repo
none

Videos

Reinforcement Learning· youtube

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Advanced Bandit Algorithms Research