Before we can find a model, we must forget about perfection

Dimiter Dobrev

arXiv:1912.04964·cs.LG·December 5, 2022

Before we can find a model, we must forget about perfection

Dimiter Dobrev

PDF

Open Access

TL;DR

This paper argues that pursuing a perfect world model in reinforcement learning is impractical and proposes focusing on multiple simple Event-Driven models as a more feasible alternative.

Contribution

It introduces Event-Driven models as a generalization of MDPs and advocates for searching multiple simple models instead of a single perfect one.

Findings

01

Perfect models are too complex and impossible to find.

02

Event-Driven models generalize MDPs for better practicality.

03

Using multiple simple models improves understanding of the environment.

Abstract

With Reinforcement Learning we assume that a model of the world does exist. We assume furthermore that the model in question is perfect (i.e. it describes the world completely and unambiguously). This article will demonstrate that it does not make sense to search for the perfect model because this model is too complicated and practically impossible to find. We will show that we should abandon the pursuit of perfection and pursue Event-Driven (ED) models instead. These models are generalization of Markov Decision Process (MDP) models. This generalization is essential because nothing can be found without it. Rather than a single MDP, we will aim to find a raft of neat simple ED models each one describing a simple dependency or property. In other words, we will replace the search for a singular and complex perfect model with a search for a large number of simple models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making