Hardness in Markov Decision Processes: Theory and Practice

Michelangelo Conserva; Paulo Rauber

arXiv:2210.13075·cs.LG·October 25, 2022

Hardness in Markov Decision Processes: Theory and Practice

Michelangelo Conserva, Paulo Rauber

PDF

Open Access 1 Video

TL;DR

This paper systematically surveys the theory of environment hardness in reinforcement learning, introduces a new benchmark and software tool for empirical analysis, and evaluates agents to advance understanding of hardness in tabular RL environments.

Contribution

It provides a comprehensive survey of hardness theory, introduces Colosseum for empirical hardness analysis, and benchmarks agents to validate hardness measures in tabular RL.

Findings

01

Hardness measures are generally applicable across environments.

02

Benchmarking reveals differences in agent performance related to environment hardness.

03

The Colosseum package facilitates standardized hardness evaluation.

Abstract

Meticulously analysing the empirical strengths and weaknesses of reinforcement learning methods in hard (challenging) environments is essential to inspire innovations and assess progress in the field. In tabular reinforcement learning, there is no well-established standard selection of environments to conduct such analysis, which is partially due to the lack of a widespread understanding of the rich theory of hardness of environments. The goal of this paper is to unlock the practical usefulness of this theory through four main contributions. First, we present a systematic survey of the theory of hardness, which also identifies promising research directions. Second, we introduce Colosseum, a pioneering package that enables empirical hardness analysis and implements a principled benchmark composed of environments that are diverse with respect to different measures of hardness. Third, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Hardness in Markov Decision Processes: Theory and Practice· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Supply Chain and Inventory Management