Correlation Priors for Reinforcement Learning

Bastian Alt; Adrian \v{S}o\v{s}i\'c; Heinz Koeppl

arXiv:1909.05106·cs.LG·October 29, 2019

Correlation Priors for Reinforcement Learning

Bastian Alt, Adrian \v{S}o\v{s}i\'c, Heinz Koeppl

PDF

TL;DR

This paper introduces a Bayesian framework using Pólya-Gamma augmentation to model correlation structures in discrete environments for reinforcement learning, improving predictive performance with less data.

Contribution

It provides a novel principled method for capturing correlations in discrete decision-making environments, which was previously lacking.

Findings

01

Outperforms correlation-agnostic models in predictive tasks

02

Effective in imitation learning, subgoal extraction, and system identification

03

Requires significantly less training data for comparable accuracy

Abstract

Many decision-making problems naturally exhibit pronounced structures inherited from the characteristics of the underlying environment. In a Markov decision process model, for example, two distinct states can have inherently related semantics or encode resembling physical state configurations. This often implies locally correlated transition dynamics among the states. In order to complete a certain task in such environments, the operating agent usually needs to execute a series of temporally and spatially correlated actions. Though there exists a variety of approaches to capture these correlations in continuous state-action domains, a principled solution for discrete environments is missing. In this work, we present a Bayesian learning framework based on P\'olya-Gamma augmentation that enables an analogous reasoning in such cases. We demonstrate the framework on a number of common…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.