Benchmarking for Bayesian Reinforcement Learning

Michael Castronovo; Damien Ernst; Adrien Couetoux; Raphael Fonteneau

arXiv:1509.04064·cs.AI·September 28, 2016

Benchmarking for Bayesian Reinforcement Learning

Michael Castronovo, Damien Ernst, Adrien Couetoux, Raphael Fonteneau

PDF

TL;DR

This paper introduces a comprehensive benchmarking methodology and open-source library for Bayesian Reinforcement Learning algorithms, enabling rigorous comparison across diverse MDPs and prior distributions.

Contribution

It provides a new benchmarking framework and open-source tools for evaluating BRL algorithms, addressing the lack of extensive comparative studies in the field.

Findings

01

Comparison of seven state-of-the-art RL algorithms on benchmark problems.

02

Analysis of computation time requirements for non-anytime algorithms.

03

Discussion of algorithm performance across different prior distributions.

Abstract

In the Bayesian Reinforcement Learning (BRL) setting, agents try to maximise the collected rewards while interacting with their environment while using some prior knowledge that is accessed beforehand. Many BRL algorithms have already been proposed, but even though a few toy examples exist in the literature, there are still no extensive or rigorous benchmarks to compare them. The paper addresses this problem, and provides a new BRL comparison methodology along with the corresponding open source library. In this methodology, a comparison criterion that measures the performance of algorithms on large sets of Markov Decision Processes (MDPs) drawn from some probability distributions is defined. In order to enable the comparison of non-anytime algorithms, our methodology also includes a detailed analysis of the computation time requirement of each algorithm. Our library is released with all…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.