Selective Dyna-style Planning Under Limited Model Capacity

Zaheer Abbas; Samuel Sokota; Erin J. Talvitie; Martha White

arXiv:2007.02418·cs.LG·March 9, 2021·6 cites

Selective Dyna-style Planning Under Limited Model Capacity

Zaheer Abbas, Samuel Sokota, Erin J. Talvitie, Martha White

PDF

Open Access 1 Video

TL;DR

This paper explores selective planning in model-based reinforcement learning by combining uncertainty estimates from heteroscedastic regression and parameter uncertainty to improve planning effectiveness with imperfect models.

Contribution

It introduces a method to estimate predictive uncertainty from model inadequacy using heteroscedastic regression, complementing existing parameter uncertainty approaches for selective planning.

Findings

01

Heteroscedastic regression signals model inadequacy effectively.

02

Combining uncertainty sources improves selective planning.

03

Model inadequacy detection complements parameter uncertainty methods.

Abstract

In model-based reinforcement learning, planning with an imperfect model of the environment has the potential to harm learning progress. But even when a model is imperfect, it may still contain information that is useful for planning. In this paper, we investigate the idea of using an imperfect model selectively. The agent should plan in parts of the state space where the model would be helpful but refrain from using the model where it would be harmful. An effective selective planning mechanism requires estimating predictive uncertainty, which arises out of aleatoric uncertainty, parameter uncertainty, and model inadequacy, among other sources. Prior work has focused on parameter uncertainty for selective planning. In this work, we emphasize the importance of model inadequacy. We show that heteroscedastic regression can signal predictive uncertainty arising from model inadequacy that is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Selective Dyna-style Planning Under Limited Model Capacity· slideslive

Taxonomy

TopicsArtificial Intelligence in Games · AI-based Problem Solving and Planning · Robotic Path Planning Algorithms