Should Models Be Accurate?

Esra'a Saleh; John D. Martin; Anna Koop; Arash Pourzarabi; Michael; Bowling

arXiv:2205.10736·cs.LG·May 24, 2022

Should Models Be Accurate?

Esra'a Saleh, John D. Martin, Anna Koop, Arash Pourzarabi, Michael, Bowling

PDF

Open Access

TL;DR

This paper questions the necessity of accurate environment models in model-based reinforcement learning, proposing a meta-learning approach that emphasizes usefulness over accuracy, leading to faster learning in non-stationary environments.

Contribution

It introduces a meta-learning algorithm that trains models based on their usefulness for learning, rather than their accuracy in simulating environment dynamics.

Findings

01

Faster learning in non-stationary environments with the proposed method

02

Accurate models are not always necessary or most useful

03

Meta-learning improves model utility for planning

Abstract

Model-based Reinforcement Learning (MBRL) holds promise for data-efficiency by planning with model-generated experience in addition to learning with experience from the environment. However, in complex or changing environments, models in MBRL will inevitably be imperfect, and their detrimental effects on learning can be difficult to mitigate. In this work, we question whether the objective of these models should be the accurate simulation of environment dynamics at all. We focus our investigations on Dyna-style planning in a prediction setting. First, we highlight and support three motivating points: a perfectly accurate model of environment dynamics is not practically achievable, is not necessary, and is not always the most useful anyways. Second, we introduce a meta-learning algorithm for training models with a focus on their usefulness to the learner instead of their accuracy in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Reinforcement Learning in Robotics · Machine Learning and Data Classification